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(54) Title: ISOLATION OF FIVE NOVEL GENES CODING FOR NEW Fc RECEPTORS-TYPE MELANOMA INVOLVED IN 
THE PATHOGENESIS OF LYMPHOMA/MELANOMA 

(57) Abstract: This invention provides an isolated nucleic acid molecule which encodes immunoglobulin receptor, Immunoglobulin 
superfamily Receptor Translocation Associated, TRTA, protein. Provided too, are the IRTA proteins encoded by the isolated nucleic 
acid molecules, IRTA1, IRTA2, IRTA3, IRTA4 or IRTA5 proteins, having the amino acid sequences set forth in any of Figures 18 A, 
18B-1-18B-3, 18C-1-18C-2, 18D-1-18D-2 or 18E-1-18E-2. Oligonucleotides of the isolated nucleic acid molecules are provided. 
Antibodies directed to an epitope of a purified IRTA1, IRTA2, IRTA3, IRTA4 or IRTA5 proteins are also provided, as are pharma- 
ceutical compositions comprising such antibodies or oligonucleotides. Methods for detecting a B cell malignancy in a sample from 
a subject; diagnosing B cell malignancy in a sample from a subject; detecting human IRTA protein in a sample; and treating a subject 
having a B cell cancer are also provided. 
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ISOLATION OF FIVE NOVEL GENES 
CODING FOR NEW Fc RECEPTORS -TYPE MELANOMA 
INVOLVED IN THE PAT HOGENESIS OF LYMPHOMA /MELANOMA 

This application claims the priority of copending U.S. 
Provisional Application Serial No. 60/168,151, filed 
November 29, 1999, the contents of which are hereby- 
incorporated by reference into the present application. 

The invention disclosed was herein made in the course of 
work under NCI Grant No. CA 4402 9 from the National Cancer 
Institute. Accordingly, the U.S. Government has certain 
rights in this invention. 

Throughout this application, various references are 
referred to in parentheses. Disclosures of these 

publication in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citation for these references may be found 
at the end of this application, preceding the claims. 

BACKGROUND OF THE INVENTION 

Abnormalities of chromosome lq21 are common in B cell 
malignancies, including B cell lymphoma and myeloma, but 
the genes targeted by these aberrations are largely 
unknown. By cloning the breakpoints of a t (1;14) (q21;q32) 
chromosomal translocation in a myeloma cell line, we have 
identified two novel genes, IRTA1 and IRTA2, encoding cell 
surface receptors with homologies to the Fc and Inhibitory 
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Receptor families. Both genes are normally expressed in 
mature B cells, but with different distributions in 
peripheral lymphoid organs: IRTAl is expressed in marginal 
zone B cells, while IRTA2 is also expressed in germinal 
center centrocytes and in immunoblasts . As the result of 
the t(l;14) translocation, the IRTAl signal peptide is 
fused to the Immunoglobulin Ca domain to produce a 
chimaeric IRTAl/Ca fusion protein. In Multiple Myeloma 
and Burkitt lymphoma cell lines with lq21 abnormalities, 
IRTA2 expression is deregulated. Thus, IRTAl and IRTA2 
are novel immunoreceptors with a potentially important 
role in B cell development and lymphomagenesis . 

B-cell Non-Hodgkin's Lymphoma (B-NHL) and Multiple Myeloma 
(MM) represent a heterogeneous group of malignancies 
derived from mature B cells with phenotypes corresponding 
to pre-Germinal Center (GC) (mantle cell), GC (follicular, 
diffuse large cell, Burkitt's), or post-GC B cells (MM) 
(for review, Gaidano and Dalla-Favera, 1997; Kuppers et 
al., 1999). Insights into the pathogenesis of these, 
malignancies have been gained by the identification of 
recurrent clonal chromosomal abnormalities characteristic 
for specific disease subtypes. The common consequence of 
these translocations is the transcriptional deregulation 
of protooncogenes by their juxtaposition to heterologous 
transcriptional regulatory elements located in the partner 
chromosome (Gaidano and Dalla-Favera, 1997) . These 
heterologous transcriptional regulatory elements can be 
derived from the Immunoglobulin (IG) locus or from other 
partner chromosomal loci. Examples include MYC in 
t (8;14) (q24;q32) in Burkitt's lymphoma (BL) (Dalla-Favera 
et al., 1982; Taub et al . , 1982), the CCND1 gene 
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deregulated by the t (11;14) (ql3;q32) in mantle cell 
lymphoma (MCL) (Rosenberg et al . , 1991) and multiple 
myeloma (MM) (Ronchetti et al . , 1999), BCL2 involved in 
the t (14;18) (q32;q21) in follicular lymphoma (FL) (Bakhshi 
5 et al., 1985), BCL6 in t (3 ; 14 ) (q27 ; q32 ) in diffuse large B 
cell lymphoma (DLCL) (Ye et al . , 1993), as well as FGFR3 
in t (4;14) (pl6;q32) (Chesi et al . , 1997), MAF in 
t (14;16) (q32;q23) (Chesi et al., 1998) and MUM1/IRF4 in 
t (6; 14) (p25;q32) (Iida et al . , 1997) in multiple myeloma 
10 (MM). The identification of these oncogenes has offered 

* 

valuable insights into the pathogenesis and diagnosis of 
their corresponding malignancies. 

Chromosomal abnormalities involving band Iq21-q23 are 

15 among the most frequent genetic lesions in both B-NHL and 
MM. Among NHL subtypes, translocation breakpoints at 
Iq21-q23, including translocations and duplications, have 
been reported, often as the single chromosomal 
abnormality, in 17-20% of follicular and diffuse large B- 

20 cell lymphoma (DLCL), in 39% of marginal- zone B cell 
lymphoma (Offit et al., 1991; Whang-Peng et al., 1995; 
Cigudosa et al., 1999) and in 27-38% of Burkitt lymphoma, 
where they represent the second most common cytogenetic 
abnormality after translocations involving the MYC proto- 

25 oncogene (Berger and Bernheim, 1985; Kornblau et al . , 
1991) . Comparative genome hybridization (CGH) has also 
identified Iq21-q23 as a recurring site for high-level 
amplification in 10% of DLCL cases (Rao et al., 1998). In 
MM, trisomy of the Iq21-q32 region has been reported in 

30 20-31% of cases (Sawyer et al . , 1995), amplification of 
the lql2-qter region in 80% of cell lines and 40% of 
primary tumors (Avet-Loiseau et al., 1997), and nonrandom 
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unbalanced whole-arm translocations of lq, associated with 
the multiduplication of the adjacent lq21-22 region, were 
found in 23% of patients with abnormal karyotypes (Sawyer 
et al., 1998) . 

The high frequency of involvement of lq21 structural 
rearrangements in B-cell malignancies suggests that this 
locus may harbor genes critical to the pathogenesis of 
these diseases. Cloning of a t (1; 14) (q21 ,-q32) in a pre-B 
cell acute lymphoblastic leukemia cell line previously 
identified a novel gene, BCL9 deregulated in this single 
case (Willis et al., 1998), but not involved in other 
cases. A recent report characterized the t (1 ; 22) (q22 ; qll ) 
in a follicular lymphoma (FL) cell line and found that the 
FCGR2B locus, encoding the low affinity IgG Fc receptor 
FCGRIIB, was targeted in this cell line and in two 
additional FL cases (Callanan et al., 2000). Finally, the 
MUC1 locus has been identified in proximity of the 
breakpoint of a t (1 ; 14 ) (q21 ; q32) in NHL (Dyomin et al., 
2000; Gilles et al . , 2000), and MUC1 locus rearrangements 
have been found in 6% of NHL with lq21 abnormalities 
(Dyomin et al . , 2000). These results highlight the 
heterogeneity of the lq21 breakpoints and the need to 
identify additional candidate oncogenes situated in this 
locus, since the large majority of these alterations 
remain unexplained. 

The aim of this study was to further explore the 
architecture of lq21 chromosomal rearrangements in B cell 
malignancy. To that end, we have employed a molecular 
cloning approach of the t (1;14) (q21;q32) present in the 
myeloma cell line FR4 . We have identified two novel genes 
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that are differentially targeted by lq21 abnormalities. 
These genes code for five novel members of the 
immunoglobulin receptor family, IRTA1 , IRTA2 , IRTA3 , IRTA4 
and IRTA5 (Immunoglobulin superfamily Receptor 

^Translocation Associated genes 1, 2, 3, 4, and 5), which 
may be important for normal lymphocyte function and B cell 
malignancy. 
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SUMMARY OF THE INVENTION 



This invention provides an isolated nucleic acid molecule 
which encodes . immunoglobulin receptor, Immunoglobulin 
superfamily Receptor Translocation Associated, IRTA, 
protein. 

This invention provides a method of producing an IRTA 
polypeptide (protein) which comprises: (a) introducing a 
vector comprising an isolated nucleic acid which encodes 
an immunoglobulin receptor, Immunoglobulin superfamily 
Receptor Translocation Associated, IRTA, protein into a 
suitable host cell; and (b) culturing the resulting cell 
so as to produce the polypeptide. 

This invention provides an isolated nucleic acid molecule 
comprising at least 15 contiguous nucleotides capable of 
specifically hybridizing with a unique sequence included 
within the sequence of the isolated nucleic acid molecule 
encoding IRTA protein. In an embodiment, the IRTA protein 
may be IRTA1, IRTA2, IRTA3, IRTA4 or IRTA5 protein, or 
fragment (s) thereof, having the amino acid sequence set 
forth in any of Figures 18A, 18B-1-18B-3, 18C-1-18C-2, 
18D-1-18D-2 or 18E-1-18E-2, respectively. 

This invention provides a method for detecting a B cell 
malignancy or a type of B cell malignancy in a sample from 
a subject wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: a) obtaining 
RNA from the sample from the subject; b) contacting the 
RNA of step (a) with a nucleic acid molecule of at least 
15 contiguous nucleotides capable of specifically 



WO 01/38490 



PCT7USOO/32403 



-7- 

hybridizing with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA protein 
selected from the group consisting of human IRTA1 , IRTA2 , 
IRTA3 , IRTA 4 and IRTA5, under conditions permitting 
hybridization of the RNA of step (a) with the nucleic acid 
molecule capable of specifically hybridizing with a unique 
sequence included within the sequence of an isolated RNA 
encoding human IRTA protein, wherein the nucleic acid 
molecule is labeled with a detectable marker; and c) 
detecting any hybridization in step (b) , wherein detecion 
of hybridization indicates presence of B cell malignancy 
or a type of B cell malignancy in the sample. 

This invention provides an. antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
an mRNA molecule encoding a human IRTA protein so as to 
prevent overexpression of the mRNA molecule. 

This invention provides a purified IRTA1 protein 
comprising the amino acid sequence set forth in Figure 18A 
(SEQ ID NO:l) . 

This invention provides a purified IRTA2 protein 

comprising the amino acid sequence set forth in Figures 
18B-1-18B-3 (SEQ ID NO:3) . 

This invention provides a purified IRTA3 protein 

comprising the amino acid sequence set forth in Figures 
18C-1-18C-2 (SEQ ID NO: 5). 
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This invention provides a purified IRTA4 protein 
comprising the amino acid sequence set forth in Figures 

18D-1-18D-2 (SEQ ID NO: 7) . 

This invention provides a purified IRTA5 . protein 
comprising the amino acid sequence set forth in Figures 
18E-1-18E-2 (SEQ ID NO: 9) . 

This invention provides an antibody/antibodies directed to 
an epitope of a purified IRTA1, IRTA2, IRTA3 , IRTA4 or 
IRTA5 protein, or fragment (s) thereof, having the amino 
acid sequence set forth in any of Figures 18A, 18B-1-18B- 
3, 18C-1-18C-2, 18D-1-18D-2 or 18E-1-18E-2. 

This invention provides an antibody directed to a purified 
IRTA protein selected from the group consisting of IRTA1 , 
IRTA2 , IRTA3 , IRTA4 and IRTA5 . 

This invention provides a pharmaceutical composition 
comprising an amount of the antibody directed to an IRTA 
protein effective to bind . to cancer cells expressing an 
IRTA protein selected from the group consisting of human 
IRTA1 , IRTA2 , IRTA3 , IRTA4 and IRTA5 so as to prevent 
growth of the cancer cells and a pharmaceutical^ 
acceptable carrier. 

This invention provides a pharmaceutical composition 
comprising an amount of any of the oligonucleotides of 
nucleic acid molecules encoding IRTA proteins described 
herein effective to prevent overexpression of a human IRTA 
protein and a pharmaceutical^ acceptable carrier capable. 
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This invention provides a method of diagnosing B cell 
malignancy which comprises a lq21 chromosomal 
rearrangement in a sample ' from a subject which comprises: 
a) obtaining the sample from the subject; b) contacting 
5 the sample of step (a) with an antibody directed to a 
purified IRTA protein capable of specifically binding with 
a human IRTA protein selected from the group consisting of 
human IRTA1, IRTA2, IRTA3 , IRTA4 and IRTA5 IRTA protein on 
a cell surface of a cancer cell under conditions 

10 permitting binding of the antibody with human IRTA protein 
on the cell surface of the cancer cell, .wherein the 
antibody is. labeled with a detectable marker; and c) 
detecting any binding in step (b) , wherein detecion of 
binding indicates a diagnosis of B cell malignancy in the 

15 sample. 

This invention provides a method of detecting human IRTA 
protein in a sample which comprises: a) contacting the 
sample with any of any of the above -described anti-IRTA 
20 antibodies under conditions permitting the formation of a 
complex between the antibody and the IRTA in the sample ; 
and b) detecting the complex formed in step (a) , thereby 
detecting the presence of human IRTA in the sample. 

25 This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 
the subject an amount of anti-IRTA antibody effective to 
bind to cancer cells expressing an IRTA protein so as to 
prevent growth of the cancer cells and a pharmaceutically 

30 acceptable carrier, thereby treating the subject. 
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This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 
the subject an amount of an antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
5 an mRNA molecule encoding a human ITRA protein so as to 
prevent overexpression of the human IRTA protein, so as to 
arrest cell growth or induce cell death of cancer cells 
expressing . IRTA protein (s) and a pharmaceutically 
acceptable carrier, thereby treating the subject. 

10 

* 

The invention also provides a pharmaceutical composition 
comprising either an effective amount of any of the 
oligonucleotides described herein and a pharmaceutically 
acceptable carrier. 

15 

The invention also provides a pharmaceutical composition 
comprising either an effective amount of an antibody 
directed against an epitope of any IRTA protein described 
herein and a pharmaceutically acceptable carrier. 

20 
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BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1B. Molecular cloning of the translocation 

t (1;14) (q21;q32) in the FR4 multiple 
myeloma cell line. Fig. 1A) Schematic 
representation of the AFR4B-5 and AFR4S-a 
clones, representing der(14) and der(l) 
breakpoints, and of the germline IgH and 
lq21 loci. Fig. IB) Nucleotide sequence 
of the breakpoint junction and its 
alignment to the corresponding germline 
regions of chromosome 14. Sol, IgA switch 
region; LCR : 3' IgH locus control region; 
B, BamHI; H, Hindi II; X, Xhol . 

Figures 2A-2B. Genomic map of the lq21 locus in the 

vicinity of the FR4 breakpoint. Fig. 2A) 
Restriction endonuclease map and schematic 
representation of genomic clones, i.e. 
bacteriophages (1) , PI artificial 

chromosomes (PACs) (2) , and yeast 
artificial chromosome (YAC) (3), spanning 
the germline lq21 locus at the FR4 
breakpoint region (arrowhead) . The name of 
each clone is placed directly on top of its 
representation. End fragments derived from 
the PAC and YAC inserts are depicted as 
circles, with either an SP6/T7 vector 
orientation (PAC) , or left /right arm vector 
orientation (YAC) . The top panel in Fig. 
1A depicts the genomic organization of two 
genes surrounding the FR4 breakpoint . The 
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two genes were identified by exon trapping 
of PAC 49A16. They are closely spaced in 
the genome, within £ 30 Kb of each other 
and are named MHM2 and MUM 3 (multiple 
myeloma-2 and 3) . In the scheme of their 
genomic loci, black boxes indicate coding 
exons, whereais white and light or medium 
grey boxes indicate non- coding exons. 
Connecting introns are lines. MUM3 (left) 
gives rise to three alternatively spliced 
mRNAs, all sharing a common 5' untranslated 
region (UTR) but diverse 3 ' UTRs (marked by 
different shades) . Numbers underneath the 
boxes identify the order of exons in the 
cDNA. Exons less than 100 bp are depicted 
as thin vertical lines. The position and 
size of each exon ' was determined by 
sequencing of genomic PAC and phage clones 
and by hybridization of cDNA probes to 
endonuclease -digested clone DNA. PAC and 
YAC mapping was performed by partial 
digestion with rare cutting enzymes 
followed by Pulse-Field-Gel-Electrophoresis 
and hybridization to internal and end- 
derived probes. Dashed lines align 
regions of overlap. S, Sad; H, Hindi II; 
S, Swal; Pc, Pad; P, Pmel; Fig. 2B) 
Genethon genetic linkage map of lq21 in the 
region of the MUM2/MUM3 locus. Sequence- 
tagged sites (STS) are ordered in 
approximate distance previously determined 
by Dib, C, et al. (1996) Nature, 380:162- 
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164. STS WI-5435 (in bold) is contained 
within YAC 23GC4 and PAC 49A16. Parallel 
vertical lines represent Interrupted 
segments, whose approximate size is 
depicted above in megabases (MB) . Sizing 
was estimated by the size of nonchimeric 
YAC contigs between two markers. The BCL9 
gene at the centromere was cloned from a 
different t (1 ; 14} (q21;q32) breakpoint by 
Willis T.G. et al . , (1998) Blood 91, 
6:1873-1881. The FcGRIIA gene is at the 
Iq21-q22 chromosomal band border. 

MUM2 mRNA structure and expression pattern. 
Fig. 3A) Schematic representation of MUM2 
mRNA. Pattern- filled, wide boxes represent 
coding domains and narrow empty boxes 
represent untranslated regions. SP, signal 
peptide; EC, extracellular domain; TM, 
transmembrane domain; CYT, cytoplasmic 
domain; A(n) , polyA tail. The 
extracellular region is composed of four 
immunoglobulin- like domains as depicted. 
Alternative polyadenylat ion signals 
(arrows) generate three MUM2 mRNA species 
(a, b, c) whose length (in Kb) ranges from 
2.6-3.5. Fig. 3B) Northern blot analysis 
of MUM2 mRNA expresion in human tissues of 
the immune system. The cDNA probe used for 
the analysis is shown as a solid bar 
underneath the mRNA scheme in Fig. 3A) . 
Each lane contains 2/xg mRNA of the 



PCT/US00/32403 

-14- 

corresponding tissue. On the right side of 
the blot, the position of RNA molecular 
weight markers is depicted. The position 
of MUM2 and GAPDH mRNA transcripts is shown 
by arrows. (A GAPDH probe was included in 
the hybridization as an internal control - 
0.15 ng labelled +50 ng unlabelled probe-). 
The rsults of this analysis show weak 
expression of MUM2 in lymph node and 
spleen. MUM2 expression was not detected 
in a variety of other human tissues (data 
not shown) . Fig. 3C) Northern blot 

analysis of MUM 2 expression in total RNA 
from EREB, a conditional EBV- transformed B 
lymphoblastoid cell line. EREB carries, the 
EBV genome with an EBNA2 -estrogen receptor 
fusion protein, active only in the presence 
of estrogen. For this experiment, cells 
were grown in the presence of estrogen 
(1/ig/ml) , followed by estrogen withdrawal 
for the indicated times. Upon estrogen 
withdrawal, EREB cells undergo G0/G1 
arrest, determined by the loss of c-myc 
expression. In Fig. 3C, a Northern blot of 
EREB total RNA (10/ig per lane) was 
hybridized with the MUM2 cDNA probe shown 
in Fig. 3A and the GAPDH internal control 
probe, as in Fig. 3B. Arrows indicate the 
position of the corresponding mRNAs on the 
EREB blot. a, band c correspond to the 
MUM2 species in panel Fig 3A. The same 
blot was then stripped and reprobed with a 
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c-myc cDNA probe (exon 2) to verify 
cellular G0/G1 arrest. Quantitation of 
MUM2 mRNA by the use of a phosphorimager 
densitometric analysis demonstrates a 10- 
fold increase in their levels within 48 hrs . 
of estrogen withdrawal, suggesting that 
MUM2 expression is elevated as the cells 
enter a resting phase. 



10 Figures 4A-4B. MUM3 mRNA structure and expression pattern. 

Fig. 4A) Schematic representation of MUM3 
mRNA Pattern-filled, wide boxes represent 
coding domains and narrow empty or gray 
boxes represent untranslated regions. SP, 

15 signal peptide; EC, extracellular domain; 

TM, transmembrane domain; CYT, cytoplasmic 
domain; A(n), polyA tail. The 
extracellular region is composed of 
immunoglobulin- like domains, as depicted. 

20 Alternative splicing generates four mRNA 

species with diverse subcellular 
localization. MUM3-a and -d proteins are 
secreted, whereas MUM3-b contains a 
hydrophobic stretch of amino acids at its 

25 . C-terminus which may serve as a .signal for 

addition of a glycophosphatidyl- inositol 
anchor (GPI -anchor) , as shown. MUM3-C 
spans the plasma membrane . Sequence 
identity among species is indicated by 

■ 

30 identical filling. Fig. 4B) Northern blot 

analysis of MUM 3 mRNA expression in 
multiple human tissues (left) and in 
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various lymphoid and non- lymphoid cell 
lines (right) . The cDNA probe used is 
shown as a solid bar below the cDNA scheme 
in Fig. 4A. Each lane contains 2 fig mRNA of 
5 the corresponding tissue or cell line. The 

position of MUM3 and . GAPDH mRNA transcripts 
is shown by arrows. (A GAPDH probe was 
included in the hybridization as an 
internal control as described in Fig. 3) 

!0 a, b, c and d correspond to the MUM3 mRNA 

species shown in Fig. 4A. RD, NC42 and 
CB33, Epstein-Barr virus transformed B 
lymphoblastoid cell lines; EREB, 
conditional EBV- transformed B 

15 lymphoblastoid cell line; FR4 , plasma cell 

line; MOLT4 and HUT78 , T cell lines; HL60 
and U937, myelomonocytic cell lines; K562, 
erythroid cell line. The results suggest 
that MUM3 is expressed solely in the immune 

20 system tissues of bone marrow, lymph and 

spleen and in particular in B cells with a 
lymphoblastoid phenotype . 

Figure 5. Nucleotide and amino acid sequence of human 

MUM2. The deduced amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted signal 
peptidase site was derived by a computer 
algorithm described in Nielsen et al . , 
Pin Rnaineerina 10, 1-6 (1997) and is 



25 



30 
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marked by an arrowhead. The 
polyadenylation signal AATAAA is 
underlined. Potential sites for N- 

glycosylation are also underlined in the 
amino acid sequence. A hydrophobic stretch 
of 16 amino acids predicted to span the 
plasma membrane is doubly underlined. 
Consensus SH2 -binding sites are highlighted 
by a wavy underline. 

Nucleotide and amino acid sequence of human 
MUM3-a. The deduced, amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted site for 
signal peptidase cleavage was derived as 
previously stated above and is. marked by an 
arrowhead. The polyadenylation signal 

ATTAAA is underlined. Potential sites for 
N-glycosylation are also underlined in the 
amino acid sequence. The protein lacks a 
transmembrane domain and is predicted to be 
secreted. 

Nucleotide and amino acid sequence of human 
MUM3-b. The deduced amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted site for 
signal peptidase cleavage was derived as 
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previously stated above and is marked by an 
arrowhead. The polyadenylation signal 

AATAAA is underlined. Potential sites for 
N-glycosylation are underlined in the amino 
acid sequence. 



Figure 6C-1-6C-2. Nucleotide and amino acid sequence of 

human MUM3-C. The . deduced amino acid 
sequence is shown above the nucleotide 
sequence in one-letter code and is 
numbered. on the right, with position 1 
set to the first codon of the signal 
peptide. The predicted site for 

signal peptidase cleavage was derived 
as previously stated above and is 
marked by an arrowhead.. The- 
polyadenylation signal AATAAA is 
underlined. Potential sites for N- 
glycosylation are underlined in the 
amino acid sequence. A hydrophobic 
stretch of 23 amino acids predicted to 
span the plasma membrane is doubly 
underlined. Consensus SH2 -binding 

sites are highlighted by a wavy 
underline . 



Figures 7A-7C. t (1;14) (q21;32) in FR4 generates a MUM2/Ca 

fusion transcript. Fig. 7A) Schematic 
representation of ther der(14) genomic 
clone XFR4B-5 and of the germline IgHAl 
locus. The FR4 breakpoint is marked by an 
arrow. Filled and open boxes represent the 
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MUM2 and Calpha coding and non- coding exons 
respectively. The position of the MUM2 
exon 1 probe used for Northern blot 
analysis is shown by a bar. Fig. 7B) 
Northern blot analysis with a MUM2 exon 1 
probe on FR4 and additional cell lines 
detects an abnormal message of 0.8 Kb, 
selectively in FR4 . Arrowheads point to 
the location of normal MUM2 message in EREB 
mRNA. JJN3 and U266, myeloma cell lines; 
EREB, conditional EBV- transformed B 
lymphoblastoid cell line. Two fig of polyA+ • 
RNA were loaded per lane. Fig. 7C) 

Nucleotide ans amino acid sequence of the 
MUM2-Ca fusion cDNA in FR4 . The cDNA was 
amplified by RT-PCR from FR4 total RNA 
using the primers shown in Fig. 7A, and was 
subsequently subcloned and sequenced. The 
deduced amino acid sequence is shown above 
the nucleotide sequence in one -letter code 
and is numbered on the right with position 
1 set to the first codon of the signal 
peptide. The predicted site for signal 
peptidase cleavage was derived as 
previously stated above and is marked by an 
arrowhead. The polyadenylation signal 

AATAAA is underlined. The Calpha 

transmembrane domain is underlined. The 
MUM2 portion of - the cDNA is shown on 
italics. H, Hindlll; B, BamHI; X, Xhol; 
Sot, IgA switch region; EC, extracellular 
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region; TM, transmembrane; *CYT , cytoplasmic 
domain. 



Figures 8A-8C. Molecular cloning of the translocation 
5 t (1;14) (q21;q32) in the FR4 multiple 

myeloma cell line. Fig. 8A) Schematic 
representation of the phage clones 
representing der(14) and der(l) breakpoints 
and the germline IGH and lq21 loci. 

10 Chromosome 14 sequences are indicated by a 

solid black line with black boxes 
representing Cal exons . , Chromosome- . 1- 
sequences are shown as a grey line. The 
probes used for chromosomal mapping are 

15 indicated below the map: Restriction 

enzyme codes are: B, BamHI ; H, Hindu I; X, 
Xhol; S, Sad; E, EcoRI . For enzymes 
marked by (*) only sites delineating the 
probes are shown. Sa: IgA switch region; 

20 LCR : 3'IgH locus control region. Fig. 8B) 

Nucleotide sequence of the breakpoint 
junctions and their alignment ■ to the 
corresponding germline regions of 
chromosomes 14 and 1. Fig. 8C) Left, 

25 fluorescence in situ hybridization (FISH) 

analysis on human normal metaphase spreads 
with the PAC clone 4 9A16 (Fig. 13) spanning 
the germlinelq21 region at the FR4 
breakpoint. Right, DAPI stained image from 

* 

30 the same metaphase spread. 



* PCT/US00/32403 

-21- 

Figures 9A-9B. Structure of IRTA1 and IRTA2 cDNAs. Figs. 

9A,9B) Schematic representation of the 
full-length IRTA1 (Fig. 9A) and IRTA2 (Fig. 
9B) cDNAs. Pattern-filled, . wide boxes 
5 represent coding domains and narrow boxes 

represent untranslated regions (UTR) .. The 
predicted site for signal peptidase 
cleavage is marked by an arrowhead and was 
derived according to the SignallP World 
10 Wide Web server at 

http : / /www, cbs , dtu ■ dk /services /Signal IP . 
The transmembrane domain prediction 
algorithm is described in Tusnady et al, 
1998. SP, signal peptide; EC, 

15 extracellular domain; Ig, immuno-globulin- 

type; TM, transmembrane domain; CYT, 
cytoplasmic domain; A(n), polyA tail; GPI, 
glycophosphatidyl inositol. In (Fig. 9A) , 
arrows in the 3 1 UTR indicate different 
20 polyadenylation addition sites utilized in 

the IRTA1 cDNA. In (Fig. -9B) , different 
3 ! UTR regions in IRTA2 isoforms are 
differentially shaded. Bars underneath the 
UTR regions' in (Fig. 9A) and (Fig. 9B) 
25 identify probes used for Northern . blot 

analysis in Figure 12 . 
•Figures 10A-10B. Comparison of the amino acid sequences 

of IRTA1 and IRTA2 with members of the 
Fc Receptor family Fig. 10A) Multiple 
30 sequence alignment of the first two 

(top) and the third (bottom) 
extracellular Ig-domains of IRTA1 and 
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IRTA2 to Fc receptor family members. 
The sequences were compared using the 
ClustalW program (Thompson et al . , . 
1994) . Black-shaded boxes indicate 
conserved aminoacids among all 
sequences; dark-grey shaded* boxes 
indicate conserved aminoacids among at 
least half of the sequences; light- 
shaded boxes indicate conservative 
substitutions. Fig. 10B) Alignment' of 
the SH2 -binding domains of IRTA1 and 
IRTA2 with the IT AM and ITIM consensus 
motifs. Conserved aminoacid positions 
are in bold. Symbol X represents any 
aminoacid. 

■ 

Figures 11A-11B-4. IRTA1 expression pattern. Fig. 11A) 

Left panel. Northern blot analysis of 
IRTA1 mRNA expression in tissues of 
the human immune system. Each lane 
contains 2mg mRNA. The position of 
RNA molecular weight markers is 
depicted on the right side of the 
blot. The positions of the IRTA1 and 

» 

GAPDH mRNA transcripts are shown by 
arrows. (A GAPDH probe was included 
in the hybridization as an internal 
control-0.15 ng labelled + 50 ng 
unlabelled probe-). Right Panel. 

Northern blot analysis of IRTA1 
expression in total RNA from the ER/EB 
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cell line (10 mg per lane) . For this 
experiment, cells were grown in the 
presence of estrogen (img/ml) , 
followed by estrogen withdrawal for 
the indicated times. Arrows indicate 
the positions of the corresponding 
mRNAs. a, b and c correspond to the 
IRTA1 differentially polyadenylated 
species. The same blot was stripped 
and reprobed with a MYC cDNA probe 
(exon 2) to verify cellular G 0 /G 1 
arrest. Densitometric analysis of 
IRTA1 mRNA levels is plotted in the 
adjacent column .graph. The cDNA probe 
used is shown as a solid bar 
underneath the IRTA1 mRNA scheme in 
Figure 9A. Fig. 11B-1-11B-4) In situ 
hybridization analysis of IRTA1 
expression in serial sections of human 
tonsil. 1. Sense IRTA1 probe 2. 
Antisense IRTA1 probe 3 . H&E staining 
4 . Antisense IRTA1 signal superimposed 
over an H&E stained section. GC, 
germinal center, MargZ, marginal zone 

■ 

Figure 12A-12B-4. IRTA2 expression pattern. Fig. 12A) 

Northern blot analysis of IRTA2 mRNA 
expression in multiple human tissues 
(left panel) and in various lymphoid 
and non- lymphoid cell lines (right 
panel) . Each lane contains 2mg mRNA. 
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The positions of the IRTA2 and GAPDH 
transcripts are shown by arrows. a, 
b, c and d correspond to the 
alternatively spliced IRTA2 mRNA 
isoforms. RD, NC42 and CB33, Epstein- 
Barr virus transformed B 

lymphoblastoid cell lines; EREB, 
conditional EBV- transformed B 

lymphoblastoid cell line; FR4 , plasma 
cell line; M0LT4 and HUT78 , T cell 
lines; HL60 and. U937, . myelomonocyt ic 
cell lines; K562, erythroid. cell line.. 
The cDNA probe used is shown as a 
solid bar underneath the IRTA2 mRNA 
scheme in Figure 9B. Figs. 12B-1-12B- 
4) In situ hybridization analysis of 
IRTA2 mRNA expression in human tonsil. 
Fig. 12B-1. Sense IRTA2 cDNA probe, 
Fig. 12B-2. Antisense IRTA2 cDNA 
probe, Fig. 12B-3. H&E staining, Fig. 
12B-4. Antisense IRTA2 cDNA probe 
signal superimposed over H&E stained 
section. GC, germinal center, MargZ, 
marginal zone 

Map of the germline lq21 region spanning 
the FR4 breakpoint and genomic organization 
of IRTA1 and IRTA2 . Primers used to 
amplify IRTA1 exons from spleen cDNA are 
marked by arrowheads on top panel. Black 
and light boxes indicate coding and non- 
coding exons respectively. Arrows indicate 



WO 01/38490 PCT/USOO/32403 



-25- 



position of BCL9, MUC1, IRTA family and 
FCGRIIB. loci. S, Sad; H, Hindi 1 1; S, 
Swal; Pc, Pad; P, Pmel; Mb, Megabases 



Figures 14A-14D. t (1; 14) (q21;q32) in FR4 generates an 

IRTAl/Ca fusion transcript. Fig. 14A) 
Schematic representation of the 
der(14) genomic clone 1FR4B-5 and of 
the germline lgCa 1 locus. The FR4 
breakpoint is marked by an arrow. 
Filled and open boxes represent the 
IRTA1 and Ca x coding and non- coding 
exons respectively. Fig. 14B) 

Northern blot analysis with an IRTA1 
exon 1 probe (shown by a bar in Fig. 
14A) on FR4 and additional cell lines 
detects an abnormal message in FR4 . 
Arrowheads point to the location of 
normal IRTA1 message in ER/EB mRNA. 
JJN3 and U266, myeloma cell lines. 
Two mg of polyA+ RNA loaded per lane. 
Fig. 14C) Schematic representation of 
the IRTAl/Ca fusion cDNA in FR4 . The 
cDNA was amplified by RT-PCR from FR4 
total RNA using the primers shown in 
(Fig. 14A) , and sequenced after 
subcloning. Fig. 14D) SDS/PAGE 

analysis of immunoprecipitates 
obtained from vector control 
transfected and IRTAl/Ca transient 
expression construct transfected 293 -T' 
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cells (lanes 1 & 2) , or the following 



cell 



lines : 



mlgA 



posit ive 



lymphoblastoid cell line-Dakiki (lane 
3), FR4 (lane 4), mlgM positive NHL 
cell line-Ramos (lane 5). H, Hindi II; 



B, BamHI; X,XhoI; 



Sa, IgA switch 



region; EC, extracellular region; TM, 
transmembrane; CYT, cytoplasmic 



IRTA2 expression is deregulated in 
cell lines carrying lq21 

abnormalities. Figs. 15A, 15B) 

Northern blot analysis of IRTA2 mRNA 
expression in Burkitt lymphoma (Fig. 

15A) and Multiple Myeloma (Fig. 15B) 

■» 

cell lines. The cDNA probe used is 
the same as in Fig. 12. Each lane 

* 

contains. 2mg mRNA. The positions of 
the IRTA2 and GAPDH mRNA transcripts 
are shown by dashes and arrows , 
respectively. The relative levels of 
IRTA2 mRNA expression in the left 
panel (Fig. 15A) were plotted on the 
right panel (Fig. ISA) after 



densitometric 



analys is 



and 



normalization versus the GAPDH levels. 
The right panel of (Fig. 15B) is a 
summary of the Northern blot analysis 
results. 



Figures 16-1-16-4 IRTA1 expression in normal lymphoid 

tissue. Paraffin- embedded sections 



WO 01/38490 



PCT7USOO/32403 



-27- 

from normal human tonsil were stained 
with the following antibodies: Fig. 
16-1) Negative control; Fig. 16-2) 
anti-CD3 mouse monoclonal to detect T 
cells; Fig. 16-3) anti-IRTAl (mIRTA) 
mouse monoclonal,; Fig. 16-4) anti- 
IRTAl (J92884K) rabbit polyclonal. 
IRTA1 positive cells are located in 
the perifollicular and intraepithelial 
region of the tonsil, the equivalent 
of the marginal zone in the spleen. 

Figure 17 IRTA1 expression in a stomach Mucosa - 

Associated-Lymphoid Tissue (MALT) B cell 
lymphoma. A paraffin-embedded section from 
a stomach MALT B cell lymphoma was stained 
with the anti-IRTAl (mIRTA) mouse 
monoclonal antibody and counterstained with 
H&E. The majority of MALT lymphomas 

analyzed were IRTA1 positive. This 
antibody therefore can be an effective tool 
in the differential diagnosis of MALT 
lymphoma. The mIRTAl antibody may also be 
proven useful in the therapy of this B cell 
tumor, similarly to the use of the anti- 
CD20 antibody (Rituximab) in the therapy of 
relapsed CD20 -positive lymphomas (Foon K. , 
Camcey J. 6: p273) . 



Figure 18A. IRTA1 cDNA and the amino acid sequence of 

the encoded IRTA1 protein. 



WO 01/38490 

Figures 18B-1-18B-3. 



5 Figures 18C-1-18C-2. 



Figures 18D-1-18D-2. 

10 

Figures 18E-1-18E-2. 

15 
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IRTA2 cDNA and the amino acid 
sequence of the encoded IRTA2 
protein. 

IRTA3 cDNA and the amino acid 
sequence of. the encoded 1RTA3 
protein. 

IRTA4 cDNA and the amino acid 
sequence of the encoded IRTA4 
protein. 

IRTA5 cDNA and the amino acid 
sequence of the encoded IRTA5 
protein. 



20 



» 
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DETAILED DESCRIPTION OF THE INVENTION 

The following standard abbreviations are used throughout 
the specification to indicate specific nucleotides: 
C=cytosine; A=adenosine; T=thymidine and G=guanosine. 

5 

This invention provides an isolated nucleic acid molecule 
which encodes immunoglobulin receptor, Immunoglobulin 
superfamily Receptor Translocation Associated, IRTA, 
protein . 

10 

As used herein "Immunoglobulin Receptor Translocation 
Associated" genes, "IRTA" are nucleic acid molecules which 
encode novel immunoglobulin superfamily cell surface 
receptors in B cells which are important in B cell 
15 development, and whose abnormal expression, e.g. 
deregulated expression, perturbs cell surface B cell 
immunological responses and thus is involved in B cell 
malignancy, including lymphomagenesis . 

20 Nucleic acid molecules encoding proteins designate "MUM- 2" 
and "MUM-3" proteins in the First Series of Experiments are 
now called "XRTA-I" and "JRTA-2" genes, i.e. nucleic acid 
molecules which encode IRTA-1 and IRTA- 2 proteins 
respectively. IRTA-3, -4 and -5 proteins are members of 

25 the same the immunoglobulin gene superfamily as are the 
IRTA-1 and' IRTA- 2 proteins. 

In an embodiment of the above -described isolated nucleic 
acid molecule which encodes immunoglobulin receptor, 
3 0 Immunoglobulin superfamily Receptor Translocation 
Associated, IRTA, protein, the encoded IRTA protein is 
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IRTA1 protein comprising the amino acid sequence set forth 
in Figure 18A (SEQ ID NO: 10.. . 

In another embodiment of the above-described isolated 
nucleic acid molecule, the encoded IRTA protein is IRTA2 
protein comprising the amino acid sequence set forth in 
Figures 18B-1-18B-3 (SEQ ID NO:3). 

In a further embodiment of the above-described isolated 
nucleic acid molecule, the encoded IRTA protein is IRTA3 
protein comprising the amino acid sequence set forth in 
Figures 18C-1-18C-2 (SEQ ID NO:5) ... 

In yet another embodiment of the above-described isolated 
nucleic acid molecule, the encoded IRTA protein is IRTA4 
protein comprising the amino acid sequence set forth in 
Figures 18D-1-18D-2 (SEQ ID NO: 7) . 

In a still further embodiment of , the above-described 
isolated nucleic acid molecule, the encoded IRTA protein 
is IRTA5 protein comprising the amino acid sequence set 
forth in Figures 18E-1-18E-2 (SEQ ID NO: 9). 

In another embodiment of any of the above -described 
isolated nucleic acid molecules, the nucleic acid molecule 
is DNA. In further embodiments, the DNA is cDNA. In 
additional embodiments, the DNA is genomic DNA. In 
another embodiment, the nucleic acid molecule is an RNA 
molecule. In yet another embodiment, the DNA molecule is 
cDNA having the nucleotide sequence set forth in Figure 
18A (SEQ ID NO:2) . In another embodiment , the DNA molecule 
is cDNA having the nucleotide sequence set forth in Figure 
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18A (SEQ ID NO:4). In a further . embodiment , the DNA 
molecule is cDNA having the nucleotide sequence set forth 
in Figure 18A (SEQ ID NO:6). In another embodiment, the 
DNA molecule is cDNA having the nucleotide sequence set 
5 forth in Figure 18A (SEQ ID NO:8). In an embodiment, the 
DNA molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO:10). In preferred 
embodiments of the isolated nucleic acid molecule, wherein 
the nucleic acid molecules encode human IRTA1, IRTA2 , 

10 IRTA3, IRTA4 or IRTA5 protein. In^ additional embodiments, 
the nucleic acid molecules encode mammalian IRTA1 protein. 
The mammalian IRTA1 protein may be murine IRTA1 protein. 
In another preferred embodiment, the isolated nucleic acid 
molecules are operatively linked to a promoter of DNA 

15 transcription. In yet another preferred embodiment of the 
isolated nucleic acid molecule, the promoter comprises a 
bacterial, yeast, insect, plant or mammalian promoter. 

This invention provides a vector comprising any of the 
2 0 above-described isolated nucleic acid molecule encoding 
' IRTA proteins, including but not limited to mammalian IRTA 
proteins, of which human and murine are preferred. 
In an embodiment, the vector is a plasmid. 

25 This invention provides a host cell comprising the above - 
described vector comprising any of the above -described 
isolated nucleic acid molecule encoding IRTA proteins. 
Preferably, the isolated nucleic acid molecules in such 
vectors are operatively linked to a promoter of DNA 

30 transcription. In another embodiment of the host cell, 
the cell is selected from a group consisting of a 
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bacterial cell, a plant cell, and insect cell and 
mammalian cell. 



This invention provides a method of producing an IRTA 
polypeptide (protein) which comprises: (a), introducing a 
vector comprising an isolated nucleic acid which encodes 
an immunoglobulin receptor, Immunoglobulin super family 
Receptor Translocation Associated, IRTA, protein, into a 
suitable host cell; and (b) culturing the resulting cell 
so as to produce the polypeptide. In further embodiments, 
the IRTA protein produced by the above-described method 
may be recovered and in a still further embodiment, may be 
purified either wholly or partially. In an embodiment 
.the IRTA protein may be any of IRTA1 , IRTA2, IRTA3, IRTA4 , 
or IRTA5 protein. In further embodiments, any of the IRTA 
proteins may be mammalian proteins. In still further 
embodiemnts, the mammalian proteins may be human or mouse 

IRTA proteins. 

< 

IRTA genes (nucleic acid molecules encoding IRTA protiens 
IRTA1 , IRTA2, IRTA3 , IRTA4 and IRTA5) are useful for the 
production of the IRTA proteins encoded thereby. ITRA 
proteins are useful for production of antibodies; such 
antibodies are used as reagents for differential diagnosis 
of lymphoma subtypes in hematopathology . Antibodies 
directed against IRTA proteins and which bind specifically 
to IRTA proteins also have therapeutic uses, i.e. to 
specifically target tumor cells, which may be used and 
administered similarly to "Rituximab" (an anti-CD20 
antibody) , which is an antibody approved by the FDA for 
therapy of relapsed CD20-positive lymphomas (Foon K., 
Cancer J. 6(5):273). Anti-IRTAl, anti-IRTA2, anti-IRTA3, 
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anti-IRTA4 and anti-IRTA5 antibodies are also useful 
markers for isolation of specific subsets of B cells in 
researchstudies of normal and tumor B cell biology. 
Moreover, Anti-IRTAl, anti-IRTA2, anti-IRTA3, anti-IRTA4 
5 and anti-IRTA5 antibodies are useful research reagents to 
experimentally study the biology of signaling in normal 
and tumor B cells. 

Methods of introducing nucleic acid molecules into cells 
10 are well known to those of skill in the art. Such methods 
include, for example, the use, of viral vectors and calcium 
phosphate co-precipitation. Accordingly, . nucleic acid 
molecules encoding IRTA proteins IRTA1, IRTA2, IRTA3 , 
IRTA4 and IRTA 5 may be introduced into cells for the 
15 production of these IRTA proteins. 

Numerous vectors for expressing the inventive proteins 
IRTA1 , IRTA2 , IRTA3 , IRTA4 , and IRTA5, may be employed. 

Such vectors, including plasmid vectors, cosmid vectors, 

* 

20 bacteriophage vectors and other viruses, are well known in 
the art. For example, one class of vectors utilizes DNA 
elements which are derived from animal viruses- such as 
bovine papilloma virus, polyoma virus, adenovirus,, 
vaccinia virus, baculovirus, retroviruses (RSV, MMTV or 

25 MoMLV) , Semliki Forest virus or SV40 virus. Additionally, 
cells which have stably integrated the DNA into their 
chromosomes may be selected by introducing one or more 
markers which allow for the selection of transfected host 
cells. The markers may provide, for example, prototrophy 

3 0 to an auxotrophic host, biocide resistance or resistance 
to heavy metals such as copper. The selectable marker 
gene can be either directly- linked to the DNA sequences to 



WO 01/38490 PCT/US00/32403 

-34- 

be expressed, or introduced into the same cell by 
cotransf ormation . 

Regulatory elements required for expression . include 
promoter sequences to. bind RNA polymerase and 
transcription initiation sequences for ribosome binding. 
Additional elements may also be needed for optimal 
synthesis of mRNA. These additional elements may include 
splice signals, as well as enhancers and termination 
signals. For example, a bacterial expression vector 
includes a promoter such as the lac promoter and for 
transcription initiation the Shine -Dalgarno sequence and 
the start codon AUG. Similarly, a eukaryotic expression 
vector includes a heterologous or homologous promoter for 
RNA polymerase II, a downstream, polyadenylation signal, 
the start codon AUG, and a termination codon for 
detachment of the ribosome.' Such vectors may be obtained 
commercially or assembled from the sequences described by 
methods well known in the art, for example the methods 
described above for constructing vectors in general. 

These vectors may be introduced into a suitable host cell 
to form a host vector system for producing the inventive 
proteins. Methods of making host vector systems are well 
known to those skilled in the art. 

Suitable host cells include, but are not limited to, 
bacterial cells (including gram positive cells), yeast 
cells, fungal cells, insect cells and animal cells. 
Suitable animal cells include, but are not . limited to HeLa 
cells, Cos cells, CV1 cells and various primary . mammalian 
cells. Numerous mammalian cells may be used as hosts, 
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including, but not limited to, the mouse fibroblast cell 
NIH-3T3 cells, CHO cells, HeLa cells, Ltk" cells and COS 
cells. Mammalian cells may be transfected by methods well 
known in the art such as calcium phosphate precipitation, 
5 electroporation and microinjection. 

This invention provides an isolated nucleic acid molecule 
comprising at least 15 contiguous nucleotides capable of 
specifically hybridizing . with a unique sequence included 

10 within the sequence of the isolated nucleic acid molecule 
encoding IRTA protein. In an embodiment, the IRTA protein 
may be IRTA1, IRTA2 , • IRTA3 , IRTA4 or IRTA5 protein, or 
fragment (s) thereof, having the: amino acid sequence set 
forth in any of Figures 18A, 18B-1-18B-3, 18C-1-18C-2, 

15 18D-1-18D-2 or 18E-1-18E-2, respectively. In other 

embodiments, the isolated nucleic acid molecules are 
labeled with a detectable marker. In still other 
embodiments of the isolated nucleic acid molecules, the 
detectable marker is selected from the group consisting of 

20 a radioactive isotope, enzyme, dye, biotin, a fluorescent 
label or a chemi luminescent label. 

This invention provides a method for detecting a B cell 
malignancy or a type of B cell malignancy in a sample from 
• 25 a subject wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: a) obtaining 
RNA from the sample from the subject; b) contacting the 
RNA of step (a) with a nucleic acid molecule of at least 
15 contiguous nucleotides capable of specifically 
30 hybridizing with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA protein 
selected from the group consisting of human IRTA1, IRTA2, 
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IRTA3, IRTA4 and IRTA5 , under conditions permitting 
hybridization of the RNA of step (a) with the nucleic acid 
molecule capable of specif ically. hybridizing with a unique 
sequence included within the sequence of an isolated RNA 
encoding human IRTA protein, wherein the nucleic acid 
molecule is labeled with a detectable marker; and c) 
detecting any hybridization in step (b) , wherein detecion 
of hybridization indicates presence of B cell malignancy 
or a type of B cell malignancy in the sample. 

- * 

Detection of hybridization of RNA encoding IRTA proteins 
will indicate that a malignancy is a B cell malignancy-, 
More specifically, detection of hybridization of RNA 
encoding ITRA1 protein indicates that the B cell 
malignancy is a Mucosa-Associated-Lymphoid Tissue (MALT) B 
cell lymphoma. Detection of hybridization of RNA encoding 
ITRA4 and . IRTA5 proteins indicate that the B cell 
malignancy is a mantle cell lymphoma. In an embodiment of 
the above -described method, the B cell malignancy 
comprises a lq2l chromosomal rearrangement. One of skill 
will use the above-described method as a diagnostic aid in 
conjunction with other standard methods of 
detecting/diagnosing malignancies, e.g. pathology of a 
tumor sample, which may indicate lymphoma and the above - 
described method will then narrow the malignancy to a B 
cell lymphoma or more specifically to MALT) B cell 
lymphoma or a mantle cell lymphoma as discussed supra. 

r * • 

One of skill is familiar with known methods of detecting 
of hybridization nucleic acid molecules to nucleic acid 
oligonucleotides, i.e. nucleic acid probes encoding a 
protein of interest for diagnostic methods. The nucleic 
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acid molecules encoding the IRTA proteins of the subject 
invention are useful for detecting B cell malignancy. One 
of skill will recognize that variations of the above - 
' described method for detecting a B cell malignancy in a 
5 sample include, but are not limited to, digesting nucleic 
acid from the sample with restrictio enzymes and 
separating the nucleic acid molecule fragments so 
oobtained by size fractionation before hybridization. 

10 In an embodiemnt of the above -described method for 
detecting a B cell malignancy in a sample from a subject, 
wherein the detectable marker is radioactive isotope,, 
enzyme, dye, biotin, a fluorescent label or a 
chemi luminescent label. In a preferred embodiment, the B 

15 cell malignancy is selected from the group consisting of B 
cell lymphoma, multiple myeloma, Burkitt ' s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma and 
follicular lymphoma cells. In a further embodiemnt, the B 
cell lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 

20 lymphoma (MALT). In another preferred embodiment, the B 
cell lymphoma is non-Hodgkin ' s lymphoma. ... 

This invention provides an antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
25 an mRNA molecule encoding a human ITRA protein so as., to., 
prevent overexpression of the mRNA molecule. 

4 

In preferred embodiments of the antisense oligonucleotide, 
the ITRA protein selected from the group consisting of 
30 human IRTA1, IRTA2, IRTA3 , IRTA 4 and IRTA5 protein. In 
further embodiments of any of the above -described 
oligonucleotides of nucleic acid molecules encoding the 
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IRTA1 , IRTA2, IRTA3 , IRTA4 and/or IRTA5 proteins, the 
nucleic acid may be genomic DNA or cDNA. 

One of skill is familiar with conventional techniques for 
nucleic acid hybridization of oligonucleotides, e.g. 
Ausubel, F.M. et al. Current Protocols in Molecular 
Biology, (John Wiley & Sons, New York, 1998), for example 
stringent conditions of 65°C in the presence of an elevated 
salt concentration. Such conditions are used for 
completely complementary nucleic acid hybridization, 
whereas conditions that are not stringent are used for 
hybridization of nucleic acids which are not totally 
complementary. 

* 

As used herein, the phrase "specifically hybridizing" 
means the ability of a nucleic acid molecule to recognize 
a nucleic acid sequence complementary to its own and to 
form double-helical segments through hydrogen bonding 
between complementary base pairs. As used herein, 
"unique sequence" is a sequence specific to only the 
nucleic acid molecules encoding the IRTA1, IRTA2 , IRTA3 , 
IRTA4 and IRTA5 proteins. Nucleic acid probe technology 
is well known to those skilled in the art who will readily 
appreciate that such probes may vary greatly in length and 
may be labeled with a detectable label, such as a 
radioisotope or fluorescent dye, to facilitate detection 
of the probe. Detection of nucleic acid molecules 
encoding the IRTA1, IRTA2, IRTA3 , IRTA4 and/or IRTA5 
proteins is useful as a diagnostic test for any disease 
process in which levels of expression of the corresponding 
IRTA1, IRTA2, IRTA3 , IRTA4 and/or IRTA5 proteins is 
altered. DNA probe molecules are produced by insertion of 



a 
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a DNA molecule which encodes mammalian IRTA1 , IRTA2, 
IRTA3, IRTA4 and/or IRTA5 proteins or fragments thereof 
into suitable vectors, such as plasmids or bacteriophages, 
followed by insertion into suitable bacterial host cells 
5 and replication and harvesting of the DNA probes, all 
using methods well known in the art. For example, the DNA 
may be extracted from a cell lysate using phenol and 
ethanol, digested with restriction enzymes corresponding 
to the insertion sites of the DNA into the vector 

10 (discussed herein), electrophoresed, and cut out of the 
resulting gel. The oligonucleotide probes are useful for 
f in situ 1 - hybridization or in order to locate tissues 
which express this IRTA gene family, and for other 
hybridization assays for the presence of these genes 

15 (nucleic acid molecules encoding any of the IRTA1-IRTA5 
protiens) or their mRNA in various biological tissues. In 
addition, synthesized oligonucleotides (produced by a DNA 
synthesizer) complementary to the sequence of a DNA 
molecule which encodes an IRTA1, IRTA2, IRTA3 , IRTA 4 or 

20 IRTA5 protein are useful as probes for these genes, for 
their associated mRNA, or - for the isolation of related 
genes by homology screening of genomic or cDNA libraries, 
or by the use of amplification techniques such as the 
Polymerase Chain Reaction. 



2E 



3C 



This invention provides a purified IRTA1 protein 
comprising the amino acid sequence set forth in Figure 18A 
(SEQ ID NO:l). In an embodiment of the purified IRTA1 
protein, wherein the IRTA1 protein is human IRTA1. 



This invention provides a purified. IRTA2 protein 
" comprising the amino acid sequence set forth in Figures 
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18B-1-18B-3 (SEQ ID NO: 3) . In' an embodiment of the 
purified IRTA2 protein, the IRTA2 protein is human IRTA2 . 

This invention provides a purified IRTA3 protein 
comprising the amino acid sequence set forth in Figures' * 
18C-1-18C-2 (SEQ ID NO: 5) . In an embodiment of the 
purified IRTA3 protein, the IRTA3 protein is human IRTA3 . 

This invention provides a . purified IRTA4 protein 
comprising the amino acid sequence set forth in Figures 
18D-1-18D-2 (SEQ ID NO: 7). In an embodiment of the 
purified IRTA3 protein, wherein the IRTA4 protein is human 
IRTA4 . 

r 

This invention provides a purified IRTA5 protein 
comprising the amino acid sequence set forth in Figures 
18E-1-18E-2 (SEQ ID NO: 9). In an embodiment of the 
purified IRTA5 protein, the IRTA5 protein is human IRTA5 . 

In order to facilitate an understanding of the 
Experimental Details section which follows, certain 
frequently occurring . methods and/or terms are best 
described in Sambrook, et al. (1989) and Harlow & Lane, 
^tibodleaj A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY: 1988. 

This invention provides an antibody/antibodies directed to 
an epitope of a purified IRTA1, IRTA2 , IRTA3 , IRTA4 or 
IRTA5 protein, or fragment (s) thereof, having the amino 
acid sequence set forth in any of Figures 18A, 18B-1-18B- 
3, 18C-1-18C-2, 18D-1-18D-2 or . 18E-1-18E-2 . 
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As used herein, the term "antibody" includes, but is not 
limited to, both naturally occurring and non-naturally 
occurring antibodies. Specifically, the term "antibody" 
includes polyclonal and monoclonal antibodies, and binding 
5 fragments thereof. Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic 
antibodies, and fragments thereof. The polyclonal and 
monoclonal antibodies may be "purified" which means the 
polyclonal and monoclonal antibodies are free of any other 

10 antibodies. As used herein, partially purified antibody 
means an antibody composition which comprises antibodies 
which specifically bind to any of the IRTA protein(s) of 
the subejct invention, and consists of fewer protein 
impurities than does the serum from which the antibodies 

15 are derived. A protein impurity is a protein other than 
the antibodies specific for the IRTA protein (s) of the 
subejct invention. For example, the partially purified 
antibodies may be an IgG preparation. 

20 Polyclonal antibodies (anti-IRTA antibodies) may be 
produced by injecting a host animal such as rabbit, rat, 
goat, mouse or other animal with the immunogen(s) of this 
invention, e.g. a purified human IRTA1, IRTA2, IRTA3 , 
IRTA4 or IRTA5, described infra. The sera are extracted 

25 from the host animal and are screened to obtain polyclonal 
antibodies which are specific to the immunogen. Methods 
of screening for polyclonal antibodies are well known to 
those of ordinary skill in the art such as those disclosed 
in Harlow & Lane, Antibodi es: A Laboratory Manual. (Cold 

30 Spring Harbor Laboratories, Cold Spring Harbor, NY: 1988) 
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the contents of which are hereby incorporated by 
reference. 

The anti-IRTA monoclonal antibodies of the subject 
invention may be produced by immunizing for example, mice 
with an immunogen (the IRTA polypeptides or fragments 
thereof as described - herein) . The mice are inoculated 
intraperitoneally with an immunogenic amount of the above - 
described immunogen and then boosted with similar amounts 
of the immunogen. Spleens are collected from - the 

immunized mice a few days after the final boost and a cell 
suspension is prepared from the spleens for use in the 
fusion. 

. Hybridomas may be prepared from the splenoqytes and a 
murine tumor- partner using the general somatic cell 
hybridization technique of Kohler, B. and Milstein, C, 
Nature (1975) 256: 495-497. Available murine myeloma 
lines, such as those from the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
VA 20110-2209, USA, may be used in the hybridization. 
Basically, the technique involves fusing the tumor cells 
and splenocytes using a fusogen such as polyethylene 
glycol. After the fusion the cells are separated from the 
fusion medium and grown in a selective growth medium, such 
as HAT medium, to eliminate unhybridized parent cells. 
The hybridomas may be expanded, if desired, and 
supernatants may be assayed by conventional immunoassay 
procedures,- for example radioimmunoassay, using the 
immunizing agent as antigen. Positive clones, may be 
characterized further to determine whether they meet the 
criteria of the invention antibodies. 
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Hybridomas that produce such antibodies may be grown in 
vitro or in vivo using known procedures. The monoclonal 
antibodies may be isolated from the culture media or body 
fluids, as the case may be, by conventional immunoglobulin 
5 purification procedures such as ammonium sulfate 
precipitation, gel electrophoresis, dialysis, 

chromatography, and ultrafiltration, if desired. 

In the practice of the subject, invention any of the above - 

10 described antibodies may be labeled with a detectable 
marker. In one embodiment, the labeled antibody is a 
purified labeled antibody. The term "antibody" includes, 
by way of example, both naturally occurring and non- 
naturally occurring antibodies. Specifically, the term- 

15 "antibody" includes polyclonal and monoclonal antibodies, 
and fragments thereof. Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic 
antibodies, and fragments thereof. A "detectable moiety" 
which functions as detectable labels are well known to 

20 those of ordinary skill in the art and include, but are 
not limited to, a fluorescent label, a radioactive atom, a 
paramagnetic ion, biotin, a chsmiluminescent label or a 
label which may be detected through a secondary enzymatic 
or binding step. The secondary enzymatic or binding step 

25 may comprise the use of digoxigenin, alkaline phosphatase, 
horseradish peroxidase, S-galactosidase, fluorescein or 
steptavidin/biotin. Methods of labeling antibodies are 
well known in the art. 

30 . Methods of recovering serum from a subject are well known 
to those skilled in the art. Methods of partially., 
purifying antibodies are also well known to those skilled 
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in the art, and include, by way of example, filtration, 
ion exchange chromatography, and precipitation. 

The polyclonal and monoclonal antibodies of the invention 
may be labeled with a detectable marker. In one 

embodiment, the labeled antibody is a purified labeled 
antibody. The detectable marker may be, for example, a 
radioactive or fluorescent marker. Methods of labeling 
antibodies are well known in the art. 

Determining whether the polyclonal and monoclonal 
antibodies of the subject invention bind to cells, e.g. 
cancer cells, expressing an IRTA protein and form a 
complex with one or more of the IRTA protein (s) described 
herein, or fragments thereof, on the surface of said 
cells, may be accomplished according to methods well known 
to those skilled in the art. In the preferred embodiment, 
the determining is accomplished according to flow 
cytometry methods . 

•»•. • 

The antibodies of the subject invention may be bound to an 
insoluble matrix such as that" used in affinity 
chromatography. Cells which form a complex, i.e. bind, 
with the immobilized polyclonal or monoclonal antibody may 
be isolated by standard methods well known to those 
skilled in the art. For example, isolation may comprise 
affinity chromatography using immobilized antibody. 

Alternatively, the antibody may be a free antibody. In 
this case, isolation may comprise cell sorting using free, 
labeled primary or secondary antibodies. Such cell sorting 
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methods are standard and are well known to those skilled 
in the art . 



This invention provides an antibody directed to a purified 
5 IRTA protein selected from the group consisting of IRTA1, 
IRTA2, IRTA3 ,, . IRTA4 and IRTA5 . In a preferred embodiment 
of the anti-IRTA antibody the IRTA protein is human IRTA 
protein. The IRTA protein may be any mammalian IRTA 
protein, including a murine IRTA protein. In a further 

10 embodiment of any the above-described antibodies, the 
antibody is a monoclonal antibody. In another embodiment, 
the monoclonal antibody is- a murine monoclonal antibody or 
a humanized monoclonal antibody. As used herein, 
"humanized" means an antibody having, characteristics of a 

15 human antibody, such antibody being non-naturally 
occurring, but created using hybridoma techniques wherein 
the antibody is of human origin except for the antigen 
determinant portion, which is murine. In yet another 
embodiment, the antibody is a polyoclonal antibody. 

20 

In preferred embodiments, any of the antibodies of the 
subject invention may be conjugated to a therapeutic 
agent. In further preferred embodiments, the therapeutic 
agent is a radioisotope, toxin, toxoid, or 
25 chemotherapeutic agent. The conjugated antibodies of the 
subject invention may be administered to a subject having 
a B cell cancer in any of the methods provided below. 



This invention provides a pharmaceutical composition 
30 comprising an amount of the antibody directed to an IRTA 
protein effective to bind to cancer cells expressing an 
IRTA protein selected from the group consisting of human 
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IRTA1, IRTA2 , IRTA3 , IRTA4 and IRTA5 so as to prevent 
growth of the cancer cells and a pharmaceutical^ 

■ 

acceptable carrier. The anti-IRTA antibody may be directed 
to an epitope of an IRTA protein selected from the group 
consisting of IRTA1, IRTA2 , IRTA3 , IRTA4 and IRTA5 . The 
IRTA proteins may be human or mouse IRTA proteins. 

In preferred embodiments of the . above-described 
pharmaceutical composition, the cancer cells are selected 
from the group consisting of B cell lymphoma, multiple 
myeloma, a . mantle cell lymphoma, Burkitt 1 s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma and 
follicular . lymphoma cells. In another preferred 

embodiment of . the - pharmaceutical composition, the B cell 
lymphoma cells are Mucosa-Associated-Lymphoid Tissue B 
cell lymphoma (MALT) cells. In another preferred, 
embodiment of the pharmaceutical composition, the B cell 
lymphoma cells are non-Hodgkin • s lymphoma cells. 

This invention provides a pharmaceutical composition 
comprising an amount of any of the above-described 
oligonucleotides effective to prevent overexpression of a 
human IRTA protein and a pharmaceutical ly acceptable 
carrier capable. In preferred embodiments of the 
pharmaceutical composition the oligonucleotide is a 
nucleic acid molecule which encodes an IRTA protein 
selected from the group consisting of IRTA1 , IRTA2 ,• IRTA3 , 
IRTA4 and IRTA5 . The IRTA proteins may be human or mouse 
IRTA proteins . 

As used herein, "malignant" means capable. of 
metastasizing. As used herein, "tumor cells" are cells 
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which originate from a tumor, i.e., from a new growth of 
different or abnormal tissue. .The tumor cells and cancer 
cells may exist as part of the tumor mass, or may exist as 
free-floating cells detached from the tumor mass from 
5 which they originate. 

As used herein, malignant cells include, but are in no way 
limited to, B cell lymphoma, multiple myeloma, Burkitt's 
lymphoma, mantle cell lymphoma, marginal zone lymphoma,. 
10 diffuse large cell lymphoma and follicular lymphoma. The 
B cell lymphoma is Mucosa-Associated-Lymphoid Tissue B 
• cell lymphoma (MALT) or is non-Hodgkin ' s lymphoma. 

As used herein, "subject" is any animal or artificially 
15 modified animal. Artificially modified animals include, 
but are not limited to, SCID mice with human immune 
systems. In a preferred embodiment, the subject is a 
human . 

20 This invention provides a method of diagnosing B cell 
malignancy which comprises a lq21 chromosomal 
rearrangement in a sample from a subject which comprises : 
a) obtaining the sample from the subject; b) contacting 
the sample of step (a) with an antibody directed to a 

25 purified IRTA protein capable of specifically binding .with 
a human IRTA protein selected from the group consisting of 
human IRTA1, IRTA2, IRTA3 , IRTA4 and IRTA5 IRTA protein on 
a cell surface of a cancer cell under conditions 
permitting binding of the antibody with human IRTA protein 

3 0 on the cell surface of the cancer cell, wherein the 
antibody is labeled with a detectable marker; and c) 
detecting any binding in step (b) , wherein detecion of 
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binding indicates a , diagnosis of B cell malignancy in the 

sample . ' 

. ■ -. ■ 

In an embodiment of the above -described method of 
diagnosing B cell malignancy, the IRTA protein is selected 
from the group consisting of IRTA1, IRTA2 , IRTA3 , IRTA4 
and IRTA5. In another embodiment of the method the IRTA 
protein is human or mouse IRTA protein. In a further 
embodiment IRTA protein is purified. In a preferred 
embodiment of this method, the B .cell malignancy - is 
selected from the group consisting of B cell lymphoma, 
multiple myeloma, .Burkitfs lymphoma, marginal zone 
lymphoma, diffuse large « cell lymphoma and follicular 
lymphoma. In yet another embodiment of this method, the B 
cell lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 
lymphoma (MALT). In another preferred embodiment of this 
method, the B cell lymphoma is non-Hodgkin 1 s lymphoma. ■ ■'■« 

• ... . . . . J. ^ 

This invention provides a method of detecting human IRTA 
protein in a sample' which comprises: a) contacting the 
sample with any of any of the above -described anti-IRTA 
antibodies under conditions permitting the formation of a 
complex between the antibody and the IRTA in the sample; 
and b) detecting the complex formed in step (a) , thereby 
detecting the presence of human IRTA in the sample. . In an 
embodiement the IRTA protein detected may be an IRTA1 , 
IRTA2, IRTA3 , ' IRTA4 or IRTA5 protein, having an amino acid 
sequence set forth in any of Figures 18A, 18B-1-18B-3, 
18C-1-18C-2, 18D-1-18D-2 or l8E-e-18E-2. As described 
hereinabove detection of the complex formed may be 
achieved by using antibody labeled with a detectable 
marker and determining presence of labeled complex. 
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Detecting human IRTA protein in a sample from a subject is 
another method of diagnosing B cell, malignancy in a 
subject. In an embodiment of this method of diagnosis, the 
B cell malignancy is selected from the group consisting of 
5 B cell lymphoma, multiple myeloma, Burkitt ■ s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma and 
follicular lymphoma. In yet another embodiment of this 
method, the B cell lymphoma is Mucosa-Associated-Lymphoid 
Tissue B cell lymphoma (MALT) . In another preferred 
10 embodiment of this method, the B cell lymphoma is non- 
Hodgkin 1 s lymphoma . 



This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 

15 the subject an amount of anti-IRTA antibody effective to 
bind to cancer cells expressing an IRTA protein so as to 
prevent growth of the cancer cells and a pharmaceutically 
acceptable carrier, thereby treating the subject- Growth 
and proliferation of the cancer cells is thereby inhibited 

2 0 amd the cancer cells die. In an embodiment of the above- 
described method, the IRTA protein is selected from the 
group consisting of human IRTA1, IRTA2, IRTA3 , IRTA4 and 
IRTA5. In a preferred embodiment of the above -described 
method of treating a subject haying a B cell cancer, the 

25 anti-IRTA antibody is a monoclonal antibody. In another 
embodiment of the method, the monoclonal antibody is a 
murine monoclonal antibody or a humanized monoclonal 
antibody. The antibody may be a chimeric antibody. In a 
further embodiment, the anti-IRTA antibody is a 

30 polyoclonal antibody. In an embodiment, the polyclonal 
antibody may be a murine or human polyclonal antibody. In 
a preferred embodiment, the B cell cancer is selected from 
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th e group consisting of B cell lymphoma, multiple myeloma, 
Burkitt's lymphoma, mantle cell lymphoma marginal zone 
lymphoma, diffuse large cell lymphoma and follicular 
lymphoma. In another preferred embodiment, the B cell 
lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 
lymphoma (MALT) . In a further preferred embodiment, the B 
cell lymphoma is non-Hodgkin 1 s lymphoma. In a preferred 
embodiment of the above -described method of treating a 
subject having a B cell cancer, administration of the 
amount of anti-IRTA antibody effective to bind to cancer 
cells expressing an IRTA protein is intravenous, 
intraperitoneal, intrathecal, ■ intralymphatical , 

intramuscular, intralesional, parenteral, epidural, 
subcutaneous; by infusion, liposome-mediated delivery, 
aerosol delivery; topical, oral, nasal, anal, ocular or 
otic delivery. In another preferred embodiment of the 
above -described methods, the anti-IRTA antibody may be 
conjugated to a therapeutic agent. In further preferred 
embodiments, the therapeutic agent is a radioisotope, 
toxin, toxoid, or chemotherapeutic agent. 

This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 
the subject an amount of an ant i sense oligonucleotide 
having a sequence capable of specifically hybridizing to 
an mRNA molecule encoding a human ITRA protein so as to 
prevent overexpression of the human IRTA protein, so as 
to arrest cell growth or induce cell death of cancer cells 
expressing IRTA protein (s) and a pharmaceutical^ 
acceptable carrier, thereby treating the subject. 
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In an embodiment of the above -described method of treating 
a subject having a B cell cancer, the IRTA protein is 
selected from the group consisting of human IRTA1, IRTA2, 
IRTA3 , IRTA4 and IRTA5 protein. In a preferred 

embodiment, B cell cancer is selected from the group 
consisting of B cell lymphoma, multiple myeloma, Burkitt 1 s 
lymphoma, marginal zone lymphoma, diffuse large cell 
lymphoma and follicular lymphoma. In another preferred 
embodiment, the B cell lymphoma is Mucosa-Associated- 
Lymphoid Tissue -B cell lymphoma (MALT). In a yet another 
preferred embodiment, the B cell lymphoma is non-Hodgkin ' s 
lymphoma. In ' embodiments of any of the above -described 
oligonucleotides of nucleic acid molecules encoding the 
IRTA1, IRTA2, IRTA3 , IRTA4 and/or IRTA5 proteins, the 
nucleic acid may be genomic DNA or cDNA. In a further 
preferred embodiment of the above -described method of 
treating a subject having a B cell cancer, administration 
of the amount of oligonucleotide of effective to prevent 
over express ion of a human IRTA protein is intravenous, 
intraperitoneal , intrathecal , intralymphat ical , 

intramuscular, intralesional, parenteral, epidural, 
subcutaneous; by infusion, liposome -mediated delivery, 
aerosol delivery; topical, oral, nasal, anal, ocular or 
otic delivery. In an embodiment of the above -described 
methods, the oligonucleotide may be conjugated to a 
therapeutic agent. In further preferred embodiments, the 
therapeutic agent is a radioisotope, toxin, toxoid, or 
chemotherapeutic agent . 

The invention also provides a pharmaceutical composition 
comprising either an effective amount of the 
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oligonucleotides or of the antibodies described above and 
a pharmaceutically acceptable carrier. In the subject 
invention an "effective amount" is any amount of an 
oligonucleotide or an antibody which, when administered to 
a subject suffering from a disease or abnormality against 
which the oligonucleotide or antibody are effective, 
causes reduction, remission, or regression of the disease 
or abnormality. In the practice of this invention the 
"pharmaceutically acceptable carrier" is any physiological 
carrier known to those of ordinary skill in the art useful 
in formulating pharmaceutical compositions. * 

Pharmaceutically acceptable carriers are well known to 
those, skilled in the art and include, but are not limited 
to, 0.01-0.1M and preferably 0 . 05M phosphate buffer or 
0.8% saline. Additionally, such pharmaceutically 
acceptable carriers may be aqueous or non- aqueous 
solutions, suspensions, and emulsions. Examples of non- 
aqueous solvents are propylene glycol, polyethylene 
glycol, vegetable oils such as olive oil, and injectable 
organic esters such as ethyl oleate . Aqueous carriers 
include water, alcoholic /aqueous solutions, emulsions or 
suspensions, including saline and buffered media. 
Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's or fixed oils. Intravenous vehicles include 
fluid and nutrient replenishers , electrolyte replenishes 
such as those based on Ringer's dextrose, and the like. 
Preservatives and other additives may also be present, 
such as, for example, antimicrobials, antioxidants, 
chelating agents, inert gases and the like. 
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In one preferred embodiment the pharmaceutical carrier may 
be a liquid and the pharmaceutical composition would be in 
the form of a solution. In another equally preferred 
embodiment, the pharmaceutically acceptable carrier is a 
5 solid and the composition is in the form of a powder or 
tablet. In a further embodiment, the pharmaceutical 
carrier is a gel and the composition is in the form of a 
suppository or cream. In a further embodiment the 
compound may be formulated as a part of a pharmaceutically 
10 acceptable transdermal patch. 

A solid carrier can include one or more substances which 
may also act as flavoring agents, lubricants, 
solubilizers, suspending agents, fillers, glidants, 
15 compression aids, binders or tablet-disintegrating agents; 
it can also be an encapsulating material. In powders, the 

■ 

carrier is a finely divided solid which is in admixture 
with the finely divided active ingredient. In tablets, 
the active ingredient is mixed with a carrier having the 

20 necessary compression properties in suitable proportions 
and compacted in the shape and size desired. The powders 
and tablets preferably contain up to 9.9% , of the active 
ingredient. Suitable solid carriers include, for example, 
calcium phosphate, magnesium stearate, talc, sugars, 

25 lactose, dextrin, starch, gelatin, cellulose, 
polyvinylpyrrolidine, low melting waxes and ion exchange 
resins . 

Liquid carriers are used in preparing solutions, 
30 suspensions, emulsions, syrups, elixirs and pressurized 
compositions. The active ingredient can be dissolved or 
suspended in a pharmaceutically acceptable liquid carrier 
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such as water, an organic solvent, a mixture of both or 
pharmaceutical ly acceptable oils or fats. The liquid 
carrier can contain other suitable pharmaceutical 
additives such as solubilizers, . emulsifiers, buffers, 
preservatives, sweeteners, flavoring agents, suspending 
agents, thickening agents, colors, viscosity regulators, 
stabilizers or osmo-regulators . Suitable examples of 
liquid carriers^ for oral and parenteral administration 
include water (partially containing additives as above, 
e.g. cellulose derivatives, preferably sodium 
carboxymethyl. cellulose solution) , alcohols (including 
monohydric alcohols and polyhydric alcohols, e.g., glycols) 
and their derivatives, and oils (e.g. fractionated coconut 
oil and arachis oil) . For parenteral administration, the 
carrier can also be an oily ester such as ethyl oleate and 
isopropyl myristate. Sterile liquid carriers are useful 
in sterile - liquid form compositions for parenteral 
administration. The liquid carrier for pressurized 

compositions can be halogenated hydrocarbon or other 
pharmaceutical ly acceptable propellent. 

Liquid pharmaceutical compositions which are sterile 
solutions or suspensions can be utilized by for example, 
intramuscular, intrathecal, epidural, intraperitoneal or 
subcutaneous injection. Sterile solutions can. ..also be 
administered intravenously. The compounds may be prepared 
as a sterile solid composition which may be dissolved or 
suspended at the time of administration using sterile 
water, saline, or other appropriate sterile injectable 
medium. Carriers are intended to include necessary and 
inert binders, suspending agents, lubricants, flavorants, 
sweeteners, preservatives, dyes, and coatings. / 
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The pharmaceutical ■ composition comprising the 
oligonucleotide or the antibody can be administered orally 
in the form of a sterile solution or suspension containing 
other solutes or suspending agents, for example, enough 
5 saline or glucose to make the solution isotonic, bile 
salts, acacia, gelatin, sorbitan mbftoleate, polysorbate 80 
(oleate esters of sorbitol and its anhydrides 
copolymerized with ethylene oxide) and the like, 

10 The pharmaceutical composition comprising the 
oligonucleotide or the antibody can also be administered 
orally either in liquid or solid composition form.. 
Compositions suitable for oral administration include 
solid forms, such as pills, capsules, granules, tablets, 

15 and powders, and liquid forms, such as solutions, syrups, 
elixirs, and suspensions. Forms useful for parenteral 
administration include sterile solutions, emulsions, and 
suspensions . 

20 Optimal dosages to be administered may be determined by 
those skilled in the art, and will vary with the 
particular inhibitor in use, the strength of the 
preparation, the mode . of administration,. and the 
advancement of the disease condition or abnormality. 

25 Additional factors depending on the particular subject 
being treated will result in a need to adjust dosages, 
including subject age, weight, gender, diet, and time of 
administration. 

3 0 This invention will . be better understood from the 
Experimental Details which follow. However, one skilled 
in the art will readily appreciate that the specific 
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methods and results discussed are merely illustrative '.of 
the invention as described more fully in the claims which 
follow thereafter. 
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EXPERIMENTAL DETAILS 

First Series of Experiments 

Molecular analysis of chromosomal translocations 
associated with multiple myeloma (MM) has indicated that 
the pathogenesis of this malignancy may be heterogeneous, 
being associated with several distinct oncogenes including 
BCL-1, MUM-1 and FGFR3 . Structural abnormalities of 
chromosome lq21, including translocations with chromosome 
14q32, represent frequent cytogenetic ■ aberrations 
associated with multiple myeloma. In order to identify 
the genes involved in these translocations, the breakpoint 
regions corresponding to both derivatives of a 
t (1; 14) (q21;q32) detectable in the FR4 human plasmacytoma 
cell line were cloned. Analysis of the breakpoint 

r 

sequences showed that they involved a reciprocal 
recombination between the Immunoglobulin heavy chain (IgH) 
locus on 14q32 and unknown sequences on lq21. The normal 
locus corresponding ot hte lq21 region involved in the 
translocation was cloned and athe genes adjacent to the 
breakpoint region were identified by an exon-trapping 
strategy. Two genes were found, located within a 20 Kb 
distance from each other, in the region spanning the 
breakpoint on lq21. The first gene, called MUM* 2 
(multiple myeloma-2) is expressed as a 2.5 Kb mRNA 
transcript detectable in spleen and lymph nodes. Cloning 
and sequencing of the full-length MUM-2cDNA predicts a 515 
amino acid cell surface glycoprotein containing four 
extracellular Ig-type domains, a transmembrane and a 
cytoplasmic domain and sharing a 3 7% identity 
(51%homology) with Fc gamma receptor I over its first 
three extracellular domains. In FR4 cells, the 
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translocation breakpoints interrupt the MUM- 2 coding 
domain and juxtapose it to the IgH locus in the same 
transcriptional orientation. As a consequence, 

structurally abnormal FR4 -specific MUM-2 transcripts (3.0, 
5 5.2 and 6.0 Kb) in lymph nodes and spleen and encodes a 
protein with an extracellular domain containing six Ig- 
type domains homologous to members of the Fc gamma and Ig- 
type adhesion receptor families. The structure of the 
MUM-2 and MUM- 3 genes and their direct involvement in a 
10 MM-associated translocation suggest that these genes code 
for novel cell surface receptors, important for normal 
lymphocyte function and B cell malignancy. , 

s second Series of Experiments 
15 Experimental Procedures 

Cell Lines 

The MM cell lines used in this study (FR4, U266, JJN3, 
EJM, SKMM1, RPMI-8226, XG1, XG2 , XG4 , XG6 , XG7) have been 
previously reported (Tagawa et al., 1990), (jernberg et 

20 al., 1987) , (Hamilton et al., 1990; Jackson et al., 1989), 
(Eton et al., 1989), (Zhang et al., 1994). The FR4 cell 
line was established in the laboratory of one of the 
authors (S.T) . The U266, J JN3 , and EJM cell lines were 
gifts from Dr. K. Nilsson (University of Uppsala, Uppsala, 

25 Sweden) and the SKMM-1 cell line was a gift of A.N. 
Houghton (Memorial Sloan Kettering Cancer Center, New 
York, NY). The five XG cell lines were obtained from Dr. 
Bernard Klein and cultured in the presence of 1 ng/ml 
human recombinant IL-6 as described previously (Zhang et 

30 al., 1994). The BL cell lines with lq21 abnormalities 
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have been previously described (Polito et al., 1995), 
(Magrath et al., 1980) and were grown in RPMI , 10% FCS . 

Genomic and cDNA library screening and DNA sequence 
analysis 

Two genomic libraries were constructed from FR4 genomic 
DNA either by BamHI complete digestion or by Sau3AI 
partial digestion and subsequent ligation of gel-purified 
fractions into the 1DASH-II phage vector (Stratagene) . The 

■ 

BamHI library was screened with a 4.2 kb Xhol -BamHI probe 
derived from the Ca locus and the Sau3AI library was 
screened with a 5'Sa probe previously described (Bergsagel 
et al., 1996). A human placental DNA library (Stratagene) 
was screened with probe 1.0EH (Figures 8A-8C) to obtain 
the germline lq21 locus. Library screening and plaque 
isolation were preformed according to established 
procedures (Sambrook et al . , 1989). IRTA1 and IRTA2 cDNA 
clones were isolated from an oligo-dT/random-primed cDNA 
library constructed from normal human spleen RNA 
(Clontech) . The IRTA1 cDNA probe used for library 
screening was obtained from RT-PCR of human spleen cDNA 
using primers flanking exons 1 and 3. DNA sequencing was 
preformed on an ABI 373 automated sequencer (Applied 
Biosystems) . Sequence homology searches were carried out 
through the BLAST e-mail server at the National Center for 
Biotechnology Information, Bethesda, MD. 

PAC and YAC isolation and exon trapping 

Human PAC clones were obtained by screening a human PAC 
library spotted onto nylon membranes (Research Genetics) , 
with the 1.0 EH probe (Figures 8A-8C) . The Zeneca 
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(formerly ICI) human YAC library (Anand et al. # 1990) 
obtained from the United Kingdom Human Genome Mapping 
Resource Center (UK-HGMP) was screened using a PCR-based 
pooling, strategy. Exon trapping was performed using the 
5 exon trapping system (Gibco BRL) , according to the 
manufacturer's instructions. 

Isolation of PAC/YAC end clones, pulsed-field gel 
electrophoresis (PFGE) and fluorescence in situ 
10 hybridization (FISH) analysis 

PAC DNA extraction was performed according to standard 
alkaline lysis methods (Drakopoli N et al. ; 199'6) . A 
vectorette-PCR method was used to isolate PAC and YAC end 

15 probes (Riley et al., 1990), as previously described (Iida 
et al, 1996) . PFGE analysis was performed according to 
standard protocols (Drakopoli N et al . , 1996) using the 
CHEF Mapper system (BioRad, Hercules, CA) . Biotin 
labeling of PAC DNA, chromosome preparation and FISH were 

20 performed as previously described (Rao et al . , 1993) . 



Southern and Northern blot analyses, RACE and RT-PCR 

Southern and northern blot analyses were performed as 
described previously (Neri et al, 1991). For Northern 

25 blot analyses total RNA was prepared by the guanidium 
thiocyanate method and poly (A) RNA was selected using 
poly (T) -coated beads (Oligotex Kit by Quigen) . For 
Northern blots, 2 mg of poly (A) RNA were loaded per lane. 
Multiple tissue Northern filters were obtained from 

30 Clontech. RACE was performed using' the Marathon cDNA 
Amplification kit (Clontech) and Marathon-Ready spleen 
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cDNA. First strand cDNA synthesis was performed using the 
Superscript RT-PCR system (Gibco BRL) 

In situ hybridization 

Digoxigenin-containing antisense and sense cRNA probes 
were transcribed with T3 and T7 RNA polymerase, 
respectively, from linearized pBluescript KS+ plasmids 
containing coding region of cDNAs (nucleotides 62 to 1681 
of IRTA1 and 18 to 2996 of IRTA2 . ) Hyperplastic human 
tonsillar tissue surgically resected from"'* "children in 
Babies 1 Hospital, Columbia Presbyterian Medical Center was 
snap frozen in powdered dry ice. Cryostat sections were 
stored for several days at -80 degrees C prior to 
processing. Non-radioactive in situ hybridization was 
performed essentially as described (Frank et al . , 1999), 
except that fixation time in 4% paraformaldehyde was 
increased to 20 minutes, and proteinase K treatment was 
omitted. The stringency of hybridization was 68 degrees 
C, in 5X SSC, 50% f ormamide . Alkaline phosphatase- 
conjugated anti-digoxigenin antibody staining was 
developed with BCIP/NBT substrate. 

Transf action, immunoprecipitation and Western Blotting 

293 cells (ATCC) , grown in DMEM, 10% FCS were transiently 
transfected, according to the standard calcium phosphate 
method, with pMT2T and pMT2T-IRTAl/Ca transient expression 
constructs. The latter was generated using the 

IRTAl/Ca RT-PCR product from FR4 . Cells (2xl0 6 of 

transfectants and 2xl0 7 of remaining cell lines) were 
solubilized in Triton X-100 lysis buffer (150 mM NaCl, 10 
mM Tris-HCl [pH 7.4], 1% Tx-100, 0.1% BSA) in the presence 
of a protease inhibitors coctail (Roche Biochemicals) . 
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Lysates were incubated at 4°C for 2 hours with 4. mg/ml. Qf 
the monoclonal antibody #117-332-1. (Yu et al . , 1990) 
(Tanox Biosystems, Inc, Houston, Texas) that was raised 
against the extracellular portion of the IgA membrane 
<■ peptide. Immune complexes were isolated with protein G- 
Sepharose (Pharmacia) prior to electrophoresis on 10-20% 
Tris-HCl gradient gels (Biorad) and immunobloting, using 
15 mg/ml of the #117-332-1 antibody. Results were 
visualized by ECL (Amersham) . 

RESULTS 

Molecular Cloning of the t(l;14) (q21;q32) 

Chromosomal translocations involving the Ig heavy- chain 
(IGH) locus often occur within or near IgH switch regions 
as a result of "illegitimate" switch recombination events 

* * * ■ 

(Dalla-Favera et al . , 1983; Chesi et al . , 1996; Chesi et 
al., 1998). The breakpoints can be detected by Southern- 
blot hybridization assays as rearranged alleles in which 
the IGH constant (C H ) region sequences have lost their 
syntenic association with IGH joining (J H ) -and 5' switch 
region (S) sequences (Dalla-Favera et al., 1983; Neri et 
al., 1988; Neri et al., 1991; Bergsagel et al., 1996). 
This assay has led to the identification of several 
chromosomal partners for the IgH locus in B-NHL and MM 
(Taub et al., 1982; Dalla-Favera et al . , 1983; Neri et 
al., 1988; Neri et al . , 1991; Ye et al . , 1993; Chesi et 
al., 1996; Richelda et al . , 1997; Iida et al . , 1997; 
Dyomin et al . , 1997; Dyomin et al.'., 2000). We employed 
the same strategy in order to clone the lq2l breakpoint 
region in FR4, a myeloma cell line carrying a 
t (1;14) (q21;q32) , as determined by cytogenetic analysis 
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(Tagawa et al., 1990; Taniwaki M, unpublished results). 
Two "illegitimately" rearranged fragments were identified 
within the Ca heavy- chain locus in FR4 by Southern blot 
hybridization analysis (data not shown) , and were cloned 
from phage libraries constructed from FR4 genomic DNA. 
Restriction mapping, Southern blot hybridization and 
partial nucleotide sequencing of two genomic phages 
(clones X FR4B-5 and X FR4S-a, Figure 8A) demonstrated that 

■ 

they contained the chromosomal breakpoints of a reciprocal 
balanced translocation between the Co^ locus on 14q32 and 
non-IGH sequences. A probe (1.0EH) representing these 
non-IgH sequences (Figure 8A) was then used to clone the 
corresponding normal genomic locus from phage, PI 
artificial chromosome (PAC) , and yeast artificial 
chromosome (YAC) human genomic libraries. Fluorescence in 
situ hybridization (FISH) analysis of normal human 
metaphase spreads using the 100 -kb non-chimaeric PAC clone 
4 9 Al 6 which spans the breakpoint region (see below, Figure 
13) , identified the partner chromosomal locus as derived 
from band lq21 (Figure 8C) . Mapping to a single locus 
within chromosome 1 was confirmed by hybridization of two 
non-repetitive probes to DNA from a somatic-cell hybrid 
panel representative of individual human chromosomes (data 
not shown) . These results were consistent with the 
cloning of sequences spanning the t (1 ; 14) (q21 ; q32) in FR4 . 

Sequence analysis of the breakpoint, regions on the 
derivative chromosomes and alignment with the germline 
14q32 and lq21 loci revealed that the breakpoint had 
occurred in the intron between the CH3 and the 
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transmembrane exon of Cot 1 on chromosome 14. Although the 
breakpoint region was devoid of recombination signal 
sequences (RSS) or switch signal sequences (Kuppers et 
al., 1999), the sequence CTTAAC (underlined on Figure 8B) 
was present in both germline chromosomes 14 and 1 at the 
breakpoint junction. One copy of this sequence was 
present in each of the derivative chromosomes, with a 
slight modification in the der(l) copy (point mutation in 
the last nucleotide: C to G) . The nucleotides AT 
preceding CTTAAC on chromosome 1 were also present in both 
derivative chromosomes (Figure 8B) . The translocation did 
not result in any loss of chromosome 1 sequences. On the 
other hand, in the chromosome 14 portion of der(l) we 
observed two deletions upstream of the breakpoint 
junction: a 16 nucleotide deletion (GGCACCTCCCCTTAAC) and 
a 4 nucleotide deletion (TGCA) 6 nucleotides upstream 
(Figure 8B) . These observations indicate that the 
t (1;14) (q21;q32) in FR4 cells represents a balanced 
reciprocal translocation possibly facilitated by the 
presence of homologous sequences- (CTTAAC) on both 
chromosomes. 



The lq21 breakpoint region contains genes coding for novel 
members of the Immunoglobulin Receptor Superfamily 

We next investigated whether the region of chromosome lq21 
spanning the translocation breakpoint in FR4 contains a 
transcriptional unit. DNA from partially overlapping PAC 
clones 4 9A16 and 210K22 (Figure 13) was "shotgun" cloned in 
plasmids, sequenced and analyzed for homology to known 
genes in human genome databases. In parallel, candidate 
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genes on the 4 9A16 PAC were sought by an exon trapping 
strategy (Church et al . , 1994). 

Mapping of the candidate exons on the lq21 genomic clones 
revealed that the FR4 breakpoint had occurred between two 
trapped exons (see below, Figure 13), which belonged to 
the same transcript since they could be linked by RT-PCR 
using spleen RNA. This RT-PCR product was then used as a 
probe to screen a spleen cDNA library in order to isolate 
full-length clones corresponding to this transcript. Two 
sets of cDNA clones were identified, belonging to two 
distinct transcripts and sharing a 76% mRNA sequence 
identity within the 443 bp probe region. Full length cDNA 
clones for both transcripts were obtained by rapid 
amplification of cDNA ends (RACE) on human spleen cDNA 
that generated 5' and 3' extension products. 

The schematic structure of the cDNA representing the first 
transcript is depicted in Figure 9A. Alternate usage of 
three potential polyadenylation sites in its 3 1 
untranslated region gives rise to three mRNA species of 
2.6, 2.7 and 3.5 kb, encoding the same putative 515-amino 
acid protein (Figure 9A) . The predicted features of this 
protein include a signal peptide, in accordance with the 
[-3, -1] rule (von Heijne, 1986), four extracellular Ig- 
type domains carrying three potential asparagine (N) - 
linked glycosylation sites (Figure 9A) , a 16 amino acid 
transmembrane and a 106 amino acid cytoplasmic domain with 
three putative consensus Src-homology 2 (SH2) -binding 
domains (Unkeless and Jin, 1997) (Figure 10B) . These 
(SH2) -binding domains exhibit features of both ITAM 
(immune-receptor Xyrosine-based Activation Motif 

- 
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-D/EX 7 D/EX 2 YXXL/IX 6 . 8 YXXL/I; where X denotes non-conserved 

i 

residues) (Reth, 1989) and ITIM motifs (Immune -receptor 
Tyrosine-based Inhibition Motif - S/V/L/IYXXL/V where X 
denotes non- conserved- residues) (Unkeless and Jin, 1997) . 
5 As shown in Figure 10B, the first two SH2 -binding domains 
are spaced 8 aminoacids apart, consistent with the 
consensus IT AM motif. Diverging from the consensus, the 
glutamate residue (E) is positioned four rather than two 
aminoacids before the first tyrosine (Y) (Figure 10B) , and 

10 the +3 position relative to tyrosine (Y) is occupied by 
valine (V) rather than leucine (L) or isoleucine (I) 
(Cambier, 1995) . All three domains conform to the ITIM 
consensus and each is encoded by a separate exon, as is 
the case * for ITIM. Thus their arrangement may give rise 

15 to three ITIM or possibly to one ITAM and one ITIM. The 
overall structure of this protein suggests that it 
represents a novel transmembrane receptor of the Ig 
superfamily and it was therefore name IRTA1 (Immune 
Receptor translocation Associated gene 1) . 

20 

The second cDNA shares homology to IRTA1 (68% nucleotide 
identity for the length of the IRTA1 message encoding its 
extracellular domain) and was named IRTA2. The IRTA2 
locus is more complex than IRTA1 and is transcribed into 

25 three major mRNA isoforms (IRTA2a, IRTA2b, IRTA2c) of 

different, molecular weight (2.8, 4.7 and 5.4 kb . 
respectively), each with its own unique 3' untranslated 
region (Figure 9B) . In addition, a 0.6 kb transcript 
(Figure 12A) arises from the usage of an early 

30 polyadenylation signal at nucleotide 536 of IRTA2 . The 
three predicted IRTA2 protein isoforms encoded by these 
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transcripts share a common aminoacid sequence until 
residue 560, featuring a common signal peptide and six 
extracellular Ig-type domains (Figure 9B) . IRTA2a encodes 
for a 759 aa secreted glycoprotein with eight Ig-type 
domains followed- by 13 unique, predominantly polar 
aminoacids at its C-terminus. IRTA2b diverges from IRTA2a 
at amino acid residue 560, and extends for a short stretch 
of 3 2 additional residues, whose hydrophobicity is 
compatible with its docking to the plasma membrane via a 
GPI-anchor (Ferguson and Williams, 1988) . IRTA2c is the 
longest isoform whose sequence . deviates from IRTA2a at 
aminoacid 746, It encodes a 977 aa type I transmembrane 
glycoprotein with nine extracellular Ig-type domains, 
harboring eight potential N- linked glycosylation sites, a 
23 aminoacid transmembrane and a 104 aminoacid cytoplasmic 
domain with three consensus SH2 -binding motifs (Figure 
10B) . Each of the SH2 -binding sites in IRTA2c agrees with 
the ITIM consensus (Figure 10B) and is encoded by a 
separate exon. These features suggest that IRTA2c is a 
novel transmembrane receptor of the Ig superfamily with 
secreted and GPI-linked isoforms. 

Homology between the IRTA proteins and Immunoglobulin 
Superfamily Receptors 

Amino acid alignment of the entire extracellular domains 
of the IRTA1 and IRTA2 proteins to each other and to other 
Ig superfamily members revealed a remarkable homology 
between them (47% identity and 51% similarity) and a 
lower, but striking homology to the Fc gamma receptor 
family of proteins. This homology was stronger in the 
aminoacid positions conserved among the different classes 



WO 01/38490 PCT/USOO/32403 



-68- 



of Fc receptors. Among Fc receptors, the high affinity 
IgG receptor FCGRI (CD64) shared the highest levels of 
homology with the first three Ig-domains of IRTA1 and 
IRTA2 (37% identity and 50% similarity) throughout its 
entire extracellular portion (Figure 10A) . Lower levels 
of homology were observed between the IRTA proteins and 
the extracellular domains of other cell surface molecules, 
including human platelet endothelial cell adhesion 
molecule (PECAM1) , B-lymphocyte cell adhesion molecule 
(CD22) and Biliary Glycoprotein 1 (BGP1) (22-25% identity, 
38-41% homology) . 

No homology is apparent between the IRTAs and members of 
the Fc receptor family in their cytoplasmic domains. In 
contrast, significant aminoacid homology is present 
between IRTA1 and PECAM1 (31% aminoacid identity and 45% 
homology), IRTA2c and BGP1 (30% identity, 35% homology) 
and IRTA2C and PECAM1 (28% identity, 50% homology) (Figure 
10B) . These homologies suggest employment of similar 
downstream signaling pathways by these different proteins. 



IRTA1 and IRTA2 are normally expressed in specific 
subpopulations of B cells 

The normal expression pattern of the IRTA1 and IRTA2 mRNAs 
was first analyzed by Northern blot hybridization of RNA 
derived from different normal human tissues and from human 
cell lines representing different hematopoietic lineages 
and stages of B-cell development. 
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IRTA1 expression was detected at a very low level in human 
spleen and lymph node RNA (Figure 11A, left panel) and was 
undetectable in all other human tissues analyzed, 
including fetal liver, bone marrow, lung, placenta, small 
5 intestine, kidney, liver, colon, skeletal muscle, heart 
and brain (data not shown) . Among B cell lines, IRTA1 
expression was absent in cell lines representing pre-B and 
germinal center B-cells, plasma cells and cells of 
erythroid, T-cell and myeloid origin (data not shown, see 

10 Materials and Methods) . Expression was detectable at very 
low levels only in EBV- immortalized lymphoblastoid cell 
lines (LCD., which represent a subpopulation 
(immunoblasts) positioned downstream of germinal center B 
cells in B-cell differentiation. However, expression was 

15 induced in estrogen- deprived ER/EB cells which, being 
immortalized by a recombinant EBV genome in which the 
EBNA2 gene is fused to the estrogen receptor, proliferate 
in the presence of estrogen while they arrest in the Go/Gj. 
phase upon estrogen deprivation (Kempkes et al., 1995). 

20 IRTAl expression was barely detectable in these cells in 
the presence of estrogen, but was induced (10 -fold) upon 
their G 0 /G l arrest following estrogen withdrawal (Figure 
11A, right panel) . Taken together, these results suggest 
that IRTAl is expressed in a lymphoid subpopulation 

25 present in spleen and lymph nodes and presumably 
represented by resting B cells. 

To further investigate the phenotype and tissue 
distribution of the cells expressing IRTAl, we performed 
30 in situ hybridization on human tonsillar tissue using a 
IRTAl antisense cDNA probe (Figure 11B) . Serial sections 
were processed for in situ hybridization with a control 



WO 01/38490 



PCT/US00/32403 



-70- 



sense cDNA probe (Panel # 1 in Figure 11B) , an antisense 
cDNA probe (Panel # 2) and hematoxylin and eosin (H&E) 
staining (Panel # 3) to outline the architecture of the 
lymphoid tissue. The IRTA1 hybridization signal was 
excluded from the germinal center and the mantle zone of 
the follicles and was characteristically concentrated in 
the perifollicular zone with infiltrations lxr the intra- 
epithelial region (Figures 11B-2, 11B-4) . In this region, 
only B cells were positive as documented by staining with 
B cell specific markers (IgD, not shown), and by 
immnunohistochemical analysis with anti-IRTAl and anti-B 
(CD20, PAX5), anti-T (CD3) , and anti-monocyte (CD68) 
antibodies (not shown; G. Cattoretti et al., manuscript in 
preparation). This perifollicular area is the "marginal 
zone" equivalent of the tonsil, representing a functionally 
distinct B-cell compartment that contains mostly memory B- 
cells and monocytoid B-cells (de Wolf-Peeters et al . , 
1997) . Together with the Northern blot analysis of normal 
tissues and cell lines, these results indicate that IRTA1 
is expressed in a subpopulation of resting mature B-cells 
topographically located in the perifollicular and 
intraepithelial region, sites rich in memory B cells. 

In the case of IRTA2, Northern blot analysis detected all 
alternatively spliced species in human lymph node, spleen, 
bone marrow and small intestine mRNA, with relative 
preponderance of the IRTA2a isoform (Figure 12A, left 
panel) . Among the hematopoietic cell lines of lymphoid 

» 

and non- lymphoid origin tested, IRTA2 expression was 
restricted to B-cell lines with an immunoblastic, post- 
germinal center phenotype (Figure 12A, right panel) . 
Similarly to IRTA1, it was absent from cell lines derived 
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from pre-B cells, germinal center centroblasts, plasma 

cells, T-cells, erythroid cells and myeloid cells (Figure 
12A, right panel) . 



5 In situ hybridization analysis of human tonsillar tissue, 
using the IRTA2c cDNA as a probe, was consistent with the 
results of the Northern blot analysis. The IRTA2 mRNA was 
largely excluded from the mantle zone of the germinal 
center, with the exception of a few positive cells 

10 (Figures 12B-2, 12B4) . Within the germinal center, the 

dark zone, represented by centroblasts, appeared negative 
for IRTA2, while the. light zone, rich in centrocytes, was 
strongly positive (Figures 12B-2, 12B-4) . Finally, IRTA2 
mRNA was detected in the "marginal zone" equivalent region 

15 outside germinal center follicles and in the 
intraepithelial and interf ollicular regions of the tonsil. 
This pattern is consistent with specificity of IRTA2 for 
centrocytes and post -germinal center B cells. Comparing 
their expression patterns, we conclude that both are 

20 specific for mature B cells, but IRTA2 has a broader 
pattern of expression that includes centrocytes and 
interf ollicular B cells, while IRTA1 is restricted to 
marginal zone B cells, most likely memory cells. 

25 Genomic organization of the IRTA1 and IRTA2 genes 

To understand the consequences of lq2l abnormalities on 
IRTA1 and IRTA2 gene structure and expression, we first 
determined the organization of their genomic loci. The 
IRTA1 gene contains 11 exons with a total genomic size of 
30 24.5 kb (Figure 13). The IRTA2 locus was found to span a 
genomic region of approximately 40 kb (Figure 13) . The 
three IRTA2 alternatively spliced products share their 
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first 8 exons, at which point IRTA2b does not utilize the 
next splicing site, and terminates by entering its 3'UTR 
region. IRTA2a and 2c isoforms splice into exon 9, with 
IRTA2a entering into its 3'UTR after exon 11 and IRTA2c 
•splicing into exon 12 and extending until, exon 18 (Figure 
13) . 

Based on sequencing data, we determined that the IRTA1 and 
IRTA2 genes are located 21 kb distant from each other, 
juxtaposed in the same transcriptional orientation (Figure 
13) that extends from the telomere (5') towards the 
centromere (3'). At the lq21 locus, they are tightly 
linked to each other as well as to three additional genes 
we recently cloned through their homology to the IRTAs 
( I. M, manuscript in preparation). All five genes are 
contiguous, covering a -300 kb region at lq21. This 
region is located at the. interval between previously 
reported lq21 breakpoints. Based on the distance between 
genomic clones harboring the respective genes on the 
Whitehead Institute Radiation Hybrid map, the IRTA1-2 
locus is estimated to lie approximately 0.8 Mb away from 
the MUC1 locus towards the telomere (N.P, unpublished 
data; Dyomin et al., 2000; Gilles et al . , 2000) and less 
than or equal to 7 Mb away from the FCGRIIB locus towards 
the centromere (N.P, unpublished data). 

The t(l;14) (q21;q32) translocation generates an IRTAl/Cai 
fusion protein in the PR4 myeloma cell line 

Comparative restriction and nucleotide sequence analysis 
of germline versus rearranged sequences from the Ca, and 

■ • 

IRTA1 loci showed that the translocation had fused 
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sequences within intron 2 of the IRTAl gene to the 
intronic sequences between the CHS and the transmembrane 
exon of Ca x in the same transcriptional orientation (Figure 
14A) . This suggested that, if IRTAl sequences were 
expressed in the translocated locus, the intact donor site 
at the 3 ' border of the IRTAl exon and the intact acceptor 
site at the 5' of Ca x could be used to generate a fusion 
IRTAl /Ca 2 mRNA, and possibly a IRTAl/Ca! fusion protein. % 

In order to test this prediction, we analyzed IRTAl mRNA 
expression in FR4 by Northern blot analysis using an IRTAl 
cDNA probe derived from exon- 1 (Figure 14A) . This probe 
detected a 0.8 kb message in FR4 that was absent from 
other B-cell lines, and was shorter than the normal 2.5 kb 
message detectable . in ER/EB cells (Figure 14B) . We cloned 
this transcript by RT-PCR of FR4 mRNA using primers 
derived from sequences #at the 5' border of IRTAl exon 1 
and the 3' border of the Ca cytoplasmic exon (Figure 14A) . 
An RT-PCR product was obtained from FR4, but not from the 
DAKIKI cell line expressing wild-type surface IgA, or 
other cell lines lacking a t(l;14) translocation (data not 
shown) . Direct sequencing analysis of the PCR product 
indicated that splicing had precisely linked IRTAl and 
Ca 2 at canonical splicing sites and determined that the 
fusion transcript was 820 bp long. 

« 

Analysis of the predicted protein product indicated that 
the IRTA1/Ca x splicing had resulted in a fusion between the 
IRTAl signal peptide and first two .. extracellular 
aminoacids, with the 32 -amino acid long extracellular 
spacer, transmembrane domain and cytoplasmic tail of the 
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membrane IgA x (mlgAj receptor (Figure 14C) . To assay for 
the expression of this fusion protein in FR4 protein 
extracts, we used an antibody directed against 
extracellular aminoacid residues specific for the 
transmembrane isoform of Cot, (Yu et al., 1990) for 
immunoprecipitation, followed by Western blotting. Our 
results demonstrated that FR4 cells, but not a control 
cell line (DAKIKI) expressing wild-type surface IgA, 
express a 9 . 8kDa protein consistent with the predicted 
size of IRTAl/Ca, fusion protein (Figure 14D) . These 
results show that the translocated allele encodes a fusion 
protein, composed of the signal peptide and first two 
extracellular residues of IRTA1 (17 aminoacids) fused to 
the Ca x encoded transmembrane and cytoplasmic domains (71 
aminoacids) . In contrast to IRTA1/Ca l overexpression on 
der(14), no expression was detected in FR4 for the 
reciprocal Ca 2 /IRTA1 transcript or for the intact IRTA2 
gene on der (1) - 

■ 

With the exception of FR4 , IRTA1 mRNA expression was not 
detected in any other myeloma or lymphoma cell line, 
regardless of the status of its chromosomal band lq21 
(data not shown) . Thus, the IRTAl/Ca fusion represents a 
rare event in lq21 aberrations. 

* 

Frequent deregulation of IRTA2 expression in cell lines 
carrying lq21 abnormalities 

In order to establish the physical relationship between 
other lq21 breakpoints and the IRTA1/2 locus, we performed 
FISH analysis with the PAC 49A16 on our panel of BL and MM 
cell lines. Among ten BL cell lines analyzed, seven with 
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dup(l) (q21q32) and three with lq21 translocations (AS283A, 
BL104, BL136) , we detected three signals corresponding to 
the IRTA1/IRTA2 locus in seven of the former and two of 
the latter, consistent with dup (1) (q21q32) in the first 
case and dup (1) (q21q32) followed by a translocation 
breakpoint at lq21 in the second. (Table 1) . FISH 
analysis of AS283A and BL13 6, using probes spanning the 
JjRTA locus and with neighboring genomic clones, placed the 
breakpoint of the derivative chromosomes outside the IRTA 
locus in both cell lines, at a distance of >800 kb towards 
the centromere in AS283A and >800 kb towards the telomere 
in BL136 (N.P, unpublished results) . Consistent with this 
finding, analysis of 30 cases of MM primary tumors by 
interphase FISH with the 300-kb YAC 23GC4 (Figure 13), 
showed that 15 cases (50% of total analyzed) had more than 
two interphase FISH signals (data not shown) , while double 
color FISH with two PAC clones flanking the YAC 
centromeric and telomeric borders detected no split of 
these two probes in any of the cases. These results 
indicate that, with the exception of . FR4 , the breakpoints 
of lq21 aberrations in BL or MM are not within or in close 
proximity to the genomic region defined by IRTA1 and 
IRTA2. However, the consistent outcome of either 

dup(l) (q21q32) (see Table 1) or dup (1) (q21q32 ) followed by 
unbalanced translocations (AS283A, BL136, XG2 , XG7 in 
Table 1) is partial trisomy or tetrasomy of the region of 
lq21 containing the IRTA genes. 
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We then investigated whether these aberrations had an 
effect on IRTA2 mRNA expression. To this end, we used a 
cDNA" probe corresponding to the IRTA2 5' untranslated 
region to screen a Northern blot . with a panel of B-NHL and 
MM cell lines lacking or displaying lq21 chromosomal 
abnormalities. The results show that most (ten out of 
twelve) BL lines with normal lq21 chromosomes essentially 
lack IRTA2 expression, consistent with the fact that BL 
derive from GC centroblasts which normally lack IRTA2 
expression (Figure 15A # left panel) . In contrast, most BL 
lines carrying lq21 abnormalities (ten out of twelve) 
clearly display IRTA2 mRNA upregulation (Figure 15A, right 
panel) , ranging from 2 to 50 fold over baseline levels 
detected in BL with normal lq21. Among myeloma cell 
lines, IRTA2 was overexpressed in one out of three lines 
displaying lq21 abnormalities (XG2) , while it was 
expressed in none out of seven with normal lq21 (Figure 
15B) . 

» 

These results show a strong correlation between the 
presence of lq21 chromosomal aberrations and deregulation 
of IRTA2 mRNA expression in BL and suggest that trisomies 
of the IRTA2 locus may deregulate its expression in this 
lymphoma subtype (see Discussion) . 

Die cuss ion 

Efforts described herein to identify genes involved in 
chromosomal aberrations affecting band lq21 in Multiple 
Myeloma and B cell lymphoma, led to the discovery of IRTA1 
and IRTA2, two founding members of a novel subfamily of 
related receptors within the immunoreceptor family; full 
length nucleic acid sequences encoding IRTA1 and IRTA2 
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proteins are provided herein, as are the amino acid 
sequences of the encoded IRTA1 and IRTA2 proteins. 
Subsequently three additional genes of members of this 
subfamily of related receptors were isolated, IRTA3 , 
IRTA4, and IRTA5 , the full length nucleic acid sequences 
of which are provided herein, as are the amino acid 
sequences of the encoded IRTA3 , IRTA4 , and IRTA5 proteins. 
These results have implications for the normal biology of 
B cells as well as for the role of lq21 aberrations in 
lymphomagenesis . 

IRTA1 and IRTA2 are founding members of a new subfamily 

within the Ig superf amily 

• ■ • 

Several features shared between the two IRTA genes and 
their encoded proteins suggest that they form a new 
subfamily within the immunoreceptor superf amily. • First, 
they share a higher degree of homology with each other in 
their extracellular domains than with other superfamily 
members both in their mRNA (68% identity) and protein (47% 
identity) sequence. Second, they share homology in their 
cytoplasmic domains, marked by the presence of ITAM-like 
and ITIM signaling motifs in the context of homologous 
aminoacid sequences. Third,. IRTA1 and IRTA2 belong to a 
larger subfamily of five genes displaying higher 
intraf amily homology and tight clustering within a -300* kb 
region at lq21 (I.'M. et al., manuscript in preparation). 
Their genomic organization suggests that a common 
ancestral gene may have given rise to this subfamily, by a 
process of duplication and sequence divergence, similar to 
the mechanism proposed for the Fc receptor family (Qiu et 
al. , 1990) . 

in their extracellular domain, the IRTA proteins are 
closely related to the Fc receptor subfamily based on the 
high degree of aminoacid homology shared especially with 
the high affinity FCGRI receptor (37-45% aminoacid 
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identity) . A common evolutionary origin with Fc receptors 
is also suggested by the position of the IRTA family locus 

in the interval between the FCGRI locus on lq21 and the 



FCERI and FCGRII-III loci on Iq21-q23. Finally, the IRTA 
5 and FCR genes share a similar exon/intron organization of 
the gene portion that encodes their signal peptide, in 
particular the two 5' leader exons with the sequences 
encoding the signal peptidase site located within the 
second 21 -bp exon. 

10 

Based on their cytoplasmic ITIM-like motifs, the 
IRTAproteins can be considered members of the Inhibitory 
Receptor Superfamily (IRS)., a group of receptors that 
block activation of many cell types in the immune system 

15 (Lanier, 1998) . Such members include FCGRI IB and CD22 in 
the human (DeLisser et al., 1994) and PIR-B in the mouse 
(Kubagawa et al . , 1997). Analogous to IRS members, the 
I TIM of IRTA1 and IRTA2 are encoded by individual exons . 
A feature that many IRS members share is the existence of 

20 corresponding activating receptor . isoforms whose 
cytoplasmic domains are devoid of I TIM (reviewed in 
Ravetch and Lanier, 1998) . It is possible that the 
secreted isoform of IRTA2, which lacks ITIM-like motifs, 
fulfills an analogous role by counteracting the effect of 

25 the transmembrane isoform. 

* 

Significant homology in the sequence and overall 
organization of their extracellular portion is shared 
among the IRTA1 and IRTA2 proteins and the Cell Adhesion 

30 Molecule (CAM) subfamily members PECAM1, CD22 and BGP1 . 
In addition, the ability of IRTA2 to generate three 
protein isoforms with distinct subcellular localization (a 
transmembrane, a GPI- linked or a secreted protein) by 
differential splicing is shared by NCAM, another member of 

35 the CAM subfamily (Dickson et al . , 1987; Gower et al . , 
1988) . Thus, the IRTA family is also related to the CAM 
family, as has been previously suggested for a member of 
the Fc receptor family (murine FCGRI I) because of its 
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homology to PECAM1 (CAM, IRS family) (Daeron, 1991; Newman 
et al., 1990; Stockinger et al., 1990). 

In conclusion, the IRTA family may represent an 
intersection among the Fc, IRS and CAM families, combining 
features from all three. Accordingly, IRTA proteins may 
have a role in the regulation of signal transduction 
during an immune response (like Fc receptors) , 
intercellular communication (like members of the IRS and 
CAM families) and cell migration (like CAM family members) 
(DeLisser et al., 1994; Ravetch and Lanier, 2000). 
Initial experiments indicate that IRTA1 can weakly bind 
heat aggregated IgA, while IRTA2c can specifically bind 
heat aggregated human serum IgG (with higher affinity for 
IgGi and IgG 2 ) , but not monomeric human IgG, IgA, IgM and 
IgE (data not shown) . These initial data lend support to 
a functional relationship between the IRTA and the Fc 
receptor families, but do not exclude functions dependent 
on other ligands for the IRTA proteins. 



Differential pattern' of expression of IRTA genes in mature 
B cells 

The IRTA genes display a specific pattern of expression in 
various normal B cell compartments. IRTA1 is 

topographically restricted to B cells within the 
perifollicular region, which was originally named marginal 
zone in the spleen, but is also detectable in most 
lymphoid organs (de Wolf-Peeters et al . , 1997). The in 
situ hybridization data presented here have been confirmed 
by immunohistochemical analysis using anti-IRTAl 
antibodies which show that the IRTAl protein is 
selectively expressed in marginal zone B cells, and, among 
NHL, in marginal zone lymphoma, the tumors deriving from 
these cells (G. Cattoretti et al., manuscript in 
preparation). On the other hand, IRTA2 has a broader 
pattern of expression that includes GC centrocytes, as 
well as a broad spectrum of perifollicular cells, which 
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may include immunoblasts and memory cells. Initial data 
suggest that the pattern of expression of IRTA3 is 
analogous to IRTA2, while IRTA4 and IRTA5 are selectively 
. ..expressed in mantle zone B cells (I. Miller et al . , 
5 manuscript in preparation) , the pre-GC compartment of 
mature B cells (MacLennan, I. C, 1994). This topographic 
restriction of IRTA gene expression in lymphoid organs 
suggests that the IRTA molecules may play a role in the 
migration or activity of various B cell subpopulations in 

.10 specific functional B cell compartments. In addition, 
IRTA expression should be useful for the differential 
diagnosis of NHL subtypes deriving from, various B cell 

* compartments, particularly IRTA1 in the diagnosis of 
marginal zone lymphoma. .. 

15 

IRTAl locus and lq21 abnormalities in MM 

In the FR4 cell line, the consequence of the t(l;l4) 
translocation is the formation of an IRTA1/Ca x fusion gene. 
Despite the fact that this gene is driven by the IRTAl 

20 promoter region, which is normally silent in plasma cells, 
its expression is high in FR4, presumably due to the 
influence of the Ca x 3' LCR, which is retained downstream 
of the Ca x locus. The fusion gene encodes a IRTA1/Ca x 
fusion protein which contains only the signal peptide and 

25 first two amino acids of IRTAl linked to the surface IgA 
receptor. The latter has been almost completely deprived 
of its extracellular . domain, but retains all its 
transmembrane and intracellular domains. This structure 
indicates that the IRTAl /Ca x fusion protein, though 

30 probably unable to bind any ligand, may retain the 
potential for dimerization and signaling. In particular, 
the membrane (m) IgA-derived extracellular portion 
contains a cysteine residue, which can be involved in 
disulphide bonds between two a-chains or between a-chains 

35 and associated proteins, such as the auxilliary surface 
receptor CD19 (Leduc et al . , 1997). The fusion protein 
also, carries the intact, 14 amino acid mlgA cytoplasmic 
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domain, which is highly conserved in evolution (Reth, 
1992) and may play an essential role in the proliferation, 
survival and differentiation of mature B-cells, analogous 
to the role of mlgG and mlgE (Kaisho et al., 1997). Thus, 
the emergence of the IRTA1/Ca x protein in FR4 may have 
provided the cells with a proliferative and survival 
advantage during tumor development through ligand 
(antigen) -independent activation of the BCR pathway. This 
fusion event however, appears to be rare in B-cell 
malignancy, since so far we were- able to detect it only in 
FR4 cells.. 



IRTA2 locus and lq21 abnormalities in MM and BL 

Abnormal expression of IRTA2 is a frequent consequence of 
lq21 abnormalities. Although this gene is not expressed 
normally either in centroblasts, the presumed normal 
counterparts of BL (Kuppers et al . , 1999), or in BL with 
normal lq21, its levels are upregulated on average by 10- 
fold in BL cell lines with lq21 abnormalities. This 
-deregulation appears to be specific for IRTA2 since all 
■ the other 4 IRTA genes present within 300 kb on lq2l are 
either not expressed in BL (IRTA1) , or their pattern of 
expression does not correlate with the presence of lq21 
abnormalities ( IRTA3 , 4, 5, not shown). The mechanism by 
which this deregulation occurs is difficult to ascertain 
in the absence of structural lesions within or adjacent to 
the IRTA2 gene. Since the heterogeneous aberrations that 
affect lq21 all cause an excess copy number of the IRTA 
locus, it is possible that this may lead to regulatory 
disturbances, as is the case for low level amplification 
of BCL2 in FL lacking (14; 18) translocations (Monni et 
al., 1997), REL in diffuse large cell lymphoma 
(Houldsworth et al., 1996; Rao et al., 1998) and 
deregulation of Cyclin Dl in some MM cases with trisomy 11 
(Pruneri et al., 2000). On the other hand, lq21 
abnormalities, including translocations and duplications, 

i 

change the genomic context of the IRTA locus and may lead 
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to deregulation of IRTA2 by distant cis-acting enhancer 
chromatin organizing elements acting on its promoter as is 
the case for MYC in endemic BL (Pelicci et al., 1986) and 
MM (Shou et al . , 2000) and for CCND1 in mantle cell 
lymphoma (Bosch et al . , 1994; Swerdlow et al . , 1995) and 
MM. (Pruneri et al . , 2000) . 

The biological consequences of deregulated IRTA2 
expression are difficult to predict at this stage. The 
observat ion that IRTA2 has homology with CAM . adhesion 
receptors, together with its specific distribution in the 
light zone of the GC suggest that its ectopic expression 
in centroblasts may cause a disruption in the GC 
development and architecture. On the other hand, our 
initial observations that IRTA2 can bind IgG immune 
complexes comparably to bona fide Fc receptors suggest 
that its inappropriate expression may perturb the dynamics 
of cell surface regulation of B cell immunological 
responses, possibly leading to clonal expansion. 
Deregulated expression of FCGR2B as a result of the 
t (1; 14) (q21;q32) in follicular lymphoma has been proposed 
to contribute to lymphomagenesis in this tumor type 
(Callanan et al., 2000), by a mechanism involving escape 
by tumor cells of anti -tumor immune surveillance through 
their Fc binding and inactivation of tumor specific IgG. 
Similar evasion ' mechanisms have been observed in cells 
infected by Fc-encoding herpesvisures (Dubin et al., 
1991) . The role of IRTA2 deregulation needs to be tested 
in "gain of function" transgenic mice constitutively 
expressing IRTA2 in the GC. 
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20 Third Series of Experiments 

Chromosome lq21 is frequently altered by translocations 
and duplications in several types of B cell malignancy, 
including multiple myeloma, Burkitt lymphoma, marginal 

25 zone lymphomas, and follicular lymphoma. To identify the 
genes involved in these aberrations, cloned was the 
chromosomal breakpoint of a t (1;14) (q21;q32) in the 
myeloma cell line FR4 . A 300kb region spanning the 
breakpoint contains at least five highly related adjacent 

3 0 genes which encode surface receptor molecules that are 
members of the immunoglobulin gene superfamily, and thus 
called IRTA (Immunoglobulin Receptor Translocation 
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Associated) . The various IRTA molecules have from three to 
nine extracellular immunoglobulin superfamily domains and 
are related to the Fc gamma receptors. They have 
transmembrane and cytoplasmic domains containing ITIM-like 
and ITAM-like (ITRA-1, IRTA- 3 # IRTA-4) signaling motifs. 
In situ hybridization experiments show that all IRTA genes 
are expresed in the B cell lineage with distinct 
developmental stage-specific patterns: IRTA-1 is expressed 
in a marginal B cell pattern. IRTA-2 is expressed in 
centrocytes and more mature B cells. As a result of the 
translocation in FR4 , IRTA-1 is broken and produces- a 
fusion transcript with the immunoglobulin locus. The 
IRTA-2 gene, normally silent in centroblasts, is 
overexpressed in multiple myeloma and in Burkitt lymphoma 
cell lines carrying lq21 abnormalities. The data here 
suggests that IRTA genes are novel B cell regulatory 
molecules that may also have a role in lymphomagenesis . 
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What is claimed is: 

1. An isolated nucleic acid molecule which encodes 
immunoglobulin receptor, Immunoglobulin superf amily 
Receptor Translocation Associated, IRTA, protein. 

2. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA1 protein comprising 
the amino acid sequence set forth in Figure 18A (SEQ 
ID NO: 1) - 

3. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA2 protein comprising 
the amino acid sequence set forth in Figures 18B-1- 
18B-3 (SEQ ID NO:3) . 

4. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA3 protein comprising 
the amino acid sequence set forth in Figures 18C-1- 
18C-2 (SEQ ID NO: 5) . 

5. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA4 protein comprising 
the amino acid sequence set forth in Figures 18D-1- 
18D-2 (SEQ ID NO: 7) . 

6. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA5 protein comprising 
the amino acid sequence set forth in Figures 18E-1- 
18E-2 (SEQ ID NO: 9) . 
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7; An isolated nucleic acid molecule of. claim 1, wherein 
the nucleic acid molecule is DNA. 

8. The isolated DNA molecule of claim 2, wherein the DNA 
is cDNA. 

9. The isolated DNA molecule of claim 2, wherein the DNA 
is genomic DNA. 

10. The isolated nucleic acid molecule of claim 1, 
wherein the nucleic acid molecule is an RNA molecule. 

.11. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO:2). 

.12. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 4) . 

13. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO:6). 

14. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 8) . 

t 

15. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 

• forth in Figure 18A (SEQ ID NO: 10) . 
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16. The isolated nucleic acid molecule of claim 1, 
wherein the nucleic acid molecule encodes a human 
IRTA1 protein. 



17. The isolated . nucleic acid molecule of claim 1 
operatively linked to a promoter of DNA 
transcription . 



18. The isolated nucleic acid molecule of claim 17, 
10 wherein the promoter comprises a bacterial, yeast, 

insect, plant or mammalian promoter. 



19. A vector comprising the nucleic acid molecule of 
claim 17 . 



20. The vector of claim 19, wherein the vector is a 
plasmid. 



21. A host cell comprising the vector of claim 20. 



22. The host cell of claim 21, wherein the cell is 
selected from a group consisting of a bacterial cell, 
a plant cell, and insect cell and a mammalian cell. 



25 23. An isolated nucleic acid molecule comprising at least 

15 contiguous nucleotides capable of specifically 
hybridizing with a unique sequence included within 
the sequence of the isolated nucleic acid molecule 
encoding IRTA1 protein of claim 1 . 



24. The isolated nucleic acid molecule of claim 23 
labeled with a detectable marker. 
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25. The nucleic acid molecule of claim 24, wherein the 
detectable marker is selected from the group 
consisting of a radioactive isotope, enzyme, dye, 
biotin, a fluorescent label or a chemiluminescent 
label. 

26. A method for detecting a B cell malignancy or a type 
of B cell malignancy in a sample from a subject 
wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: 

a) obtaining RNA from the sample from the subject; 

b) contacting the RNA of step (a) with a nucleic 
acid molecule of at least 15 contiguous 
nucleotides capable of specifically hybridizing 
with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA 
protein selected from the group consisting of 
human IRTA1 , IRTA2, IRTA3 , IRTA4 and IRTA5 , 
under conditions permitting hybridization of the 
RNA of step (a) with the nucleic acid molecule 
capable of specifically hybridizing with a 
unique sequence included within the sequence of 
an isolated RNA encoding human IRTA protein, 
wherein the nucleic acid molecule is labeled 
with a detectable marker; and 

c) detecting any hybridization in step (b) , wherein 
detecion of hybridization indicates presence of 
B cell malignancy or a type of B cell malignancy 
in the sample . 
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27. The method of claim 26, wherein the detectable marker 
is radioactive isotope, enzyme, dye, biotin, a 
fluorescent label or a chemiluminescent label. . 

28. The method of claim 26, wherein the B cell malignancy 
is selected from the group consisting of . B cell 
lymphoma, multiple myeloma, Burkitt f s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma 
and follicular lymphoma cells. 

29. The method of claim 28, wherein the B cell lymphoma 
.is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 

(MALT) . 

30. The method of claim 28, wherein the B cell lymphoma 
is non-Hodgkin' s lymphoma. 

31. An antisense oligonucleotide having a sequence 
capable of specifically hybridizing to an mRNA 
molecule encoding a human ITRA protein so as to 
prevent overexpression of the mRNA molecule. 

32. The antisense oligonucleotide of claim 31, wherein 
the ITRA protein selected from the group consisting 
of human IRTA1 , IRTA2, IRTA3, IRTA4 and IRTA5 
protein, 

33. A purified IRTA1 protein comprising the amino acid 
sequence set forth in Figure 18A (SEQ ID NO:l) . 

34. The purified IRTA1 protein of claim 33, wherein the 
IRTA1 protein is human IRTA1 . 
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35. A purified IRTA2 protein comprising the amino acid 
sequence set forth in Figures 18B-1-18B-3 (SEQ ID 
NO : 3 ) . 

36. The purified IRTA2 protein of claim. 35, wherein the 
IRTA2 protein is human IRTA2 . 

37. A purified IRTA3 protein comprising the amino acid 
sequence set forth in Figures 18C-1-18C-2 (SEQ ID 
NO: 5) . 

38. The purified IRTA3 protein of claim 37, wherein the 
IRTA3 protein is human IRTA3 . 

39. A purified IRTA4 protein comprising the amino acid 
sequence set forth in Figures 18D-1-18D-2 (SEQ ID NO: 
7) . 

* * • 

40". The purified IRTA3 protein of claim 39, wherein the 
IRTA4 protein is human IRTA4 . 

« 

* 

41. A purified IRTA5 protein comprising the amino acid 
sequence set forth in Figures 18E-1-18E-2 (SEQ ID NO: 
9) . 

42. The purified IRTA5 protein of claim 41, wherein the 
IRTA5 protein is human IRTA5 . 

43. An antibody directed to a purified " IRTA protein 
selected from the group consisting of human IRTA1, 
IRTA2 , IRTA3, IRTA4 and IRTA5 . 
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44. The antibody of claim 43, wherein the IRTA protein is 
human IRTA protein. 

45. The antibody of claim 43, wherein the antibody is a 
monoclonal antibody or a polyoclonal antibody. 



46. The antibody of claim 43, wherein the monoclonal 
antibody is a murine monoclonal . antibody or a 
humanized monoclonal antibody. 



47. The antibody of claim 43, wherein the antibody is 
conjugated to a therapeutic agent, wherein the 
therapeutic agent is selected from the group 
consisting of a radioisotope, a toxin, a toxoid, or a 
15 chemotherapeutic agent. 



48. A pharmaceutical composition comprising an amount of 
the antibody of claim 43 effective to bind to cancer 
cells expressing an IRTA protein selected from the 
20 group consisting of human IRTA1, IRTA2, IRTA3 , IRTA4 

and IRTA5 so as to prevent growth of the cancer cells 
and a pharmaceutical ly acceptable carrier. 



49. The pharmaceutical composition of claim 48, wherein 
25 the cancer cells are selected from the group 

consisting of B cell lymphoma, a mantle cell lymphoma 
multiple myeloma, Burkitt 1 s lymphoma, marginal zone 
lymphoma, diffuse large cell lymphoma and follicular 
lymphoma cells. 
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50. The pharmaceutical composition of claim 49, wherein 
the B cell lymphoma cells are Mucosa-Associated- 
Lymphoid Tissue B cell lymphoma (MALT) cells. 

51. The pharmaceutical composition of claim 49, wherein 
the B cell lymphoma cells are non-Hodgkin 1 s lymphoma 
cells . 

52. A pharmaceutical composition comprising an amount of 
the oligonucleotide of claim 31 effective to prevent 
overexpression of a human IRTA protein • and a 
pharmaceutical^ acceptable carrier. 

53. A method of diagnosing B cell malignancy which 
comprises a lq21 chromosomal rearrangement in a 
sample from a subject which comprises: 

a) obtaining the sample from the subject; 

b) contacting the sample of step (a) with the 
antibody of claim 43 capable of specifically 
binding with a human IRTA protein selected from 
the group consisting of human IRTA1 , IRTA2 , 
IRTA3, IRTA4 and IRTA5 IRTA protein on a cell 
surface of a cancer cell under conditions 
permitting binding of the antibody with human 
IRTA protein on the cell surface of the cancer 
cell, wherein the antibody is labeled with a 
detectable marker; and 

c) detecting any binding in step (b) , wherein 
detecion of binding indicates a diagnosis of B 
cell malignancy in the sample. 



WO 01/38490 



PCT/USOO/32403 



10 



25 



-107- 

54. The method of claim 53, wherein the B cell malignancy 
is selected from the group consisting of B cell 
lymphoma, multiple myeloma, Burkitt ' s lymphoma, 
mantle cell lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 



55. The method of claim 54, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 



56 . The method of claim 54 , wherein the B cell lymphoma 
is non-Hodgkin 1 s lymphoma . 



57. A method of treating a subject having a B cell cancer 
15 which comprises administering to the subject an 

amount of anti-IRTA antibody effective to bind to 
cancer cells expressing an IRTA protein selected from 
the group consisting of human IRTA1, IRTA2, IRTA3 , 
IRTA4 and IRTA5 so as to prevent growth of the cancer 
20 cells and a pharmaceutically acceptable carrier, 

thereby treating the subject. 



58. The method of claim 57, wherein the anti-IRTA 
antibody is a monoclonal antibody. 



59. The method of claim 58, wherein the monoclonal 
antibody is a murine monoclonal antibody or a 
humanized monoclonal antibody. 



30 60. The method of claim 57, wherein the anti-IRTA 

antibody is a polyoclonal antibody. 
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61. The method of claim 57, wherein the B cell cancer is 
selected from the group consisting of B cell 
lymphoma, multiple myeloma, mantle cell lymphoma, 
Burkitt ! s lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 

62. The method of claim 61, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 

63. The method of . claim 61, wherein the B cell lymphoma 
is non-Hodgkin* s lymphoma. 

64. A method of treating a subject having a B cell cancer 
which comprises administering to the subject an 
amount of the oligonucleotide of claim 31 effective 
to prevent overexpression of a human IRTA protein, so 
as to arrest cell growth or induce cell death of 
cancer cells expressing IRTA protein (s) and a 
pharmaceutical^ acceptable carrier, thereby treating 
the subject. 

65. The method of claim 64, wherein the IRTA protein is 
selected from the group consisting of human IRTA1, 
IRTA2, IRTA3, IRTA4 and IRTA5 protein. 

66. The method of claim 64, wherein the B cell cancer is 
selected from the group consisting of B cell 
lymphoma, mantle cell lymphoma, multiple myeloma, 
Burkitt's lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 
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67. The method of claim 66, wherein the B cell lymphoma 

. is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 



68. The method of claim 66, wherein the B cell lymphoma 
is non-Hodgkin 1 s lymphoma. 
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FIGURE 5 



1 CTCAATCAGCTTTATGCAGAGAAGAAGCTTACTGAGCTCACTGCTGGTGCTGGTGTAGGCAAGTGCTGCTTTGGCAA 

M L L W A S 

78 TCTGGGCTGACCTGGCTTGTCTCCTCAGAACTCCTTCTCCAACCCTGGAGCAGGCTTCCATGCTGCTGTGGGCGTCC 

LLAFAPVCGVQSAAAHKPVISVHPPWT 32 
155 TTGCTGGCCTTTGCTCCAGTCTGTGGACAATCTGCAGCTGCAC ACAAACCTGTG ATTTCCGTCCATCCTCCATGGAC 

TFFKGERVTLTCNGFQFYATEKTTWY58 
232 CACATTCTTCAAAGGAGAGAGAGTGACTCTGACTTGCAATGGATTTCAGTTCTATGCAACAGAGAAAACAACATGGT 

HRHYWGEKLTLTPGNTLEVRESGLY 83 
309 ATCATCGGCACTACTGGGGAGAAAAGTTGACCCTGACCCCAGG AAAC ACCCTCG AGGTTCGGGAATCTGGACTGT AC 

RCQARGSPRSNPVRLLFSSDS LI LQA109 
386 AGATGCCAGGCCCGGGGCTCCCCACGAAGTAACCCTGTGCGCTTGCTCTTTTCTTC AG ACTCCTT AATCCTGC AGGC 

PYSVFEGDTLVLRCHRRRKEKLTAVK 135 
463 ACCATATTCTGTGTTTG AAGGTGACACATTGGTTCTGAG ATGCC AC AG AAGAAGG AAAGAG AAATTGACTGCTGTG A 

YTWNGNILSISNKSWDL LIPQASSN 160 
540 AATATACTTGGAATGGAAACATTCTTTCCATTTCTAATAAAAGCTGGG ATCTTCTTATCCC ACAAGCAAGTTC AAAT 

NNGNYRCIGYGDENDVFRSNFKIIKI 186 
617 AACAATGGCAATTATCG ATGCATTGGATATGG AGATGAG AATGATGTATTTAGATCAAATTTC AAAATAATTAAAAT 

QELFPH PELXATDSQPTBGN SVNL SC212 
694 TCAAGAACTATTTCCACATCCAGAGCTGAAAGCTACAGACTCTCAGCCTACAG AGGGGAATTCTGTAAACCTGAGCT 

ETQLPPERSDTPLHFNFFRDGEVI L 237 
771 GTGAAACACAGCTTCCTCCAGAGCGGTCAG ACACCCCACTTCACTTCAACTTCTTCAG AG ATGGCGAGGTCATCCTG 

SDWSTY PE LQL PTVWRSNSG S YWCGA2 63 
848 TCAGACTGGAGCACGTACCCGGAACTCCAGCTCCCAACCGTCTGGAG AG AAAACTCAGGATCCTATTGGTGTGGTGC 

ETVRGNI H K H S PS LQIHVQR I PVSGV289 
925 TGAAACAGTGAGGGGTAACATCCACAAGCACAGTCCCTCGCTACAGATCCATGTGCAGCGG ATCCCTGTGTCTGGGG 

LLETQPSGGQAVEGEMLVLVCSVAE 314 
1002 TGCTCCTGGAGACCCAGCCCTCAGGGGGCCAGGCTGTTGAAGGGGAGATGCTGGTCCTTGTCTGCTCCGTGGCTGAA 

GTGDTTFSWHREDMQESLGRKTQRSL 340 
1079 GGCACAGGGGATACCACATTCTCCTGGCACCGAG AGGACATGCAGGAG AGTCTGGGG AGG AAAACTCAGCGTTCCCT 

RASLELPAIRQSHAGGYYCTADNS Y G 366 
1156 GAGAGCAGAGCTGGAGCTCCCTGCCATCAGACAGAGCCATGCAGGGGGATACTACTGTACAGCAGACAACAGCTACG 

P V Q S M V L N_ VTVRETPGNRDGLVAAG 391 
1233 GCCCTGTCCAGAGCATGGTGCTGAATGTC ACTGTGAGAGAGACCCCAGGCAACAG AG ATGGCCTTGTCGCCGCGGG A 

A T fifiLLfiAL^LAV ALL?HC W R R R K S G 417 
1310 GCCACTGG AGGGCTGCTCAGTGCTCTTCTCCTGGCTGTGGCCCTGCTGTTTCACTGCTGGCGTCGGAGG AAGTC AGG 

VGFLGDETRLPPAPGPGESSHSIC PA 443 
1387 AGTTGGTTTCTTGGGAGACGAAACCAGGCTCCCTCCCGCTCCAGGCCCAGGAGAGTCCTCCC ATTCCATCTGCCCTG 

Q V E L Q S L Y V D V HPKKGDLV Y„- S g I Q T 468 

1464 CCCAGGTGGAGCTTCAGTCGTTGTATGTTGATGTACACCCCAAAAAGGGAGATTTGGTATACTCTGAGATCCAGACT 

TQLGEEEEANTSRTLLEDKDVSVV Y § 494 

1541 ACTCAGCTGGGAGAAG AAG AGGAAGCTAATACCTCCAGG ACACTTCTAG AGG AT AAGG ATGTCTC AGTTGTCTACTC 

g V KTQHPDMSAGK I SSKDEES * 515 

1618 TGAGGTAAAGACACAACACCCAGATAACTCAGCTGGAAAGATCAGCTCTAAGGATG AAG AAAGTTAAGAG AATGAAA 
169 5 AGTTACGGGAACGTCCTACTCATGTG ATTTCTCCCTTGTCCAAAGTCCCAGGCCCAGTGCAGTCCTTGCGGC ACCTG 
1772 GAATGATCAACTC ATTCC AGCTTTCT AATTCTTCTC ATGC AT ATGC ATTC ACTCCC AGGAATACTC ATTCGTCTACT 
1849 CTGATGTTGGGATGGAATGGCCTCTGAAAGACTTCACTAAAATGACCAGGATCCACAGTTAAGAGAAGACCCTGTAG 
1926 TATTTG CTGTGGGCCTGACCTAATGC ATTCC CTAGGGTCTGCTTTAG AG AAGGGGG AT AAAG AG AG AGAAGG ACTGT 
2003 TATGAAAAACAGAAGCACAAATTTTGGTGAATTGGGATTTGCAGAGATGAAAAAGACTGGGTGACCTGGATCTCTGC 
2080 TTAATACATCTACAACCATTGTCTCACTGGAGACTCACTTGCATCAGTTTGTTT AACTGTG AGTGGCTGCAC AGGCA 
2157 CTGTGCAAACAATGAAAAGCCCCTTCACTTCTGCCTGCACAGCTTACACTGTCAGGATTCAGTTGCAGATTAAAGAA 
2234 CCCATCTGGAATGGTTTACAG AGAG AGGAATTTAAAAG AGGACATCAGAAGAGCTGG AG ATGCAAGCTCTAGGCTGC 
2311 GCTTCCAAAAGCAAATGATAATTATGTTAATGTCATTAGTGACAAAGATTTGCAACATTAGAGAAAAGAGACACAAA 
2 388 TATAAAATTAAAAACTTAAGTACCAACTCTCCAAAACTAAATTTGAACTTAAAATATTAGTATAAACTCATAMAAA 
2 465 CTCTGC CTTT AAAT AAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a 

1 CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTG 

MLLWVI LLVLAPVSG VQ FART ? R 22 
31 GTGGTCCAGGTCTTCATGCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAG7GGACAGTTTGCAAGGACACCCAG 

PIIFLQPPWTT7F QG E RVTLTCKGFRF49 
161 GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATTTCGCT 

YSPQKTKW YHRYLGKE ILRETPDN I L 7 5 
241 TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAG AAATACTAAGAGAAACCCCAGACAATATCCT? 

EVQESGEYRCQAQGS PLSSPVHLDF S S102 
321 GAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 

ASLILQAPLSVFEGDSVVLR'CRAKAE V129 
4 01 AGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAG 

TLNNTIYKNDNVLAFLNKRTDFH I ?K 155 

4 81 TAACACTGAATAATACTATTTACAAGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDNGAYRCTGYKSSCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATATAAGGAAAGTTGTTGCCCTGTTTCTTCCAATACAGTCAA 

I QVQEPFTRPVLRAS S FQ PZSG N PVT L209 
641 AATCCAAGTCCAAG AGCCATTT ACACGTCCAGTGCTGAGAGCC AGCTCCTTCC AGCCCATC AGCG GG AACCCAGTG ACCC 

TCETQLSLERSDVPLRFRFFRDDQTL 235 
7 21 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATGTCCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 

GLGWSLS PNFQ ITAMWSK DSG FYWC KA 262 
301 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGGAGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSVI SDSPRSWIQVQIPASHP'V L289 

5 31 AGCAACAATGCCTCACAGCGTCATATCTG ACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTC ATCCTGTCC 

TLSPEKALNFSGTKVTLHCE-TQEDSL31S 
9 61 TCACTCTCAGCCCTGAAAAGGCTCTGAATTTTGAGGGAACCAAGGTGACACTTCACTGTGAAACCCAGGAAGATTCTCTG 

RTLYRFYHEGVPLRH K SVRCE R G A S I S 342 
1041 CGC AC TTTGT ACAGGTTTT ATC ATG AGGGTG T CC CC CTG AGG CACAAG T C AG TC CG CTGTG AAAGGG GAG C ATCC ATC AG 

F SLTTENSGNYYCTADNG L G A X P S K AV3 69 
1121 CTTCTCACTGACTACAGAGAATTCAGGGAACTACTACTGCACAGCTGACAATGGCCTTGGCGCCAAGCCCAGTAAGGCTG 

SLSVTVPVSHPVLNLSSPEDLIFEGA 395 
1201 TGAGCCTCTCAGTCACTGTTCCCGTG7CTCATCCTGTCCTCAACCTC AGCTCTCCTG AGG ACCTG ATTTTTG AGGG AGCC 

KVTLHCEAQRGSLPI LYQFHHEDAA LS 422 
1281 AAGGTG ACACTTCACTGTG AAGCCCAG AG AGGTTCACTCCCCATCCTGT ACC AGTTTCATCATG AGG ATGCTGCCCTGG A 

RRSANSAGGVA I S F S L TAEHSGNY Y C T449 
1361 GCGTAGGTCGGCCAACTCTGCAGGAGGAGTGGCCATCAGCTTCTCTCTGACTGCAGAGCATTCAGGGAACTACTACTGCA 

ADMGFGPQRS KAVSLSITVPVSHPVL4"7 5 
1441 CAGCTGACAATGGCTTTGGCCCCCAGCGCAGTAAGGCGGTGAGCCTCTCCATCACTGTCCCTGTGTCTCATCCTGTCCTC 

T LSSAEAL.T FSGATVT LHCEVQRG S ?Q 502 
1521 ACCCTCAGCTCTGCTG AGGCCCTGACTTTTGAAGGAGCCACTGTGACACTTCACTGTGAAGTCC AGAGAGGTTCCCC ACA 

I LYQFYHEDMPLWSSSTPSVGRVS F S F529 
1601 AATCCTATACCAGTTTTATCATGAGGACATGCCCCTGTGGAGCAGCTCAACACCCTCTGTGGGAAGAGTGTCCTTCAGCT 

SLTEGHSGNYYCTADNGFGPQRS E V V 555 

1 6 S 1 TCTCTCTGACTGAAGGACATTCAGGGAATTACTACTGCACAGCTGACAATGGCTTTGGTCCCCAGCGCAGTGAAGTGGTG 

S L FVTV P V S R P I L T L. R V ? RAQAVVG D L5S2 

17 61 AGCCTTTTTGTCACTGTTCCAGTGTCTCGCCCCATCCTCACCCTCAGGGTTCCC AGGGCCCAGGCTGTGGTGGGGGACCT 

L ELHCEA PRGS P? I I, YWFYHEDVT LG S609 
1341 GCTGGAGCTTCACTGTGAGGCCCCGAGAGGCTCTCCCCCAATCCTGTACTGGTTTTATCATGAGGATGTCACCCTGGGGA 

SSAPSGGEASFNL S ITAE HSGN YS CS 635 
1921 GCAGCTCAGCCCCCTCTGGAGGAGAAGCTTCT7TCAACCTCTCTCTG ACTGCAGAACATTCTGGAAACTACTCATGTGAG 

ANNGLVAQHSDTISLSVIVPVSRPI L T 662 
2001 GCCAACAATGGCCTAGTGGCCCAGCACAGTGACACAATATCACTCAGTGTTATAGTTCCAGTATCTCGTCCCATCCTCAC 

FR APRAQAVVGDLLSLHCEALRGS S P 1689 
2081 CTTCAGGGCTCCCAGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGAGAGGCTCCTCCCCAA 

LYWFYHEDVT LGK I S A P S_ G G G A S FN L 715 
2161 TCCTGTACTGGTTTTATCATGAAGATGTCACCCTGGGTAAGATCTCAGCCCCCTCTGGAGGAGGGGCCTCCTTCAACCTC 

S LTTEHSG I YSCEADNGLEAQRS EMV T 742 
2241 TCTCTGACTACAGAACATTCTGGAATCTACTCCTGTG AGGCAGACAATGGTCTGGAGGCCCAGCGCAGTG AGATGGTG AC 

LKVAGEWALPTSSTSSN* 7 59 

2321 ACTGAAAGTTGCAGGTGAGTGGGCCCTGCCCACCAGCAGCACATCTGAGAACTG ACTGTGCCTGTTCTCCCTGCAGCTGA 
2401 AAATGGAGCCACAGAGCTCCTCAGGGCTGTTTGCTTGTGTGGCATCCCAGCACACTTCCTGCCTGCAGAACCTCCCTGTG 
2481 AAAGTCTCGGATCCTTTGTGGTATGGTTCCAGGAATCTGATGTTTCCCAGCAGTCTTCTTGAAG ATGATCAAAGCACCTC 
2 561 ACTAAAAATGCAAATAAGACTTTTTTAGAACATAAACTATATTCTGAACTGAAATTATTACATGAAAATGAAACCAAAGA 
2641 ATTCTGAGCATATGTTTCTCTGCCGTAG AAAGGATTAAGCTGTTTCTTGTCCGG ATTCTTCTCTCATTG ACTTCTAAGAA 
2721 GCCTCTACTCTTGAGTCTCTTTCATTACTGGGGATGTAAATGTTCCTTACATTTCCAC ATTAAAA ATCCTATGTTAACGA 
AAAAA 
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FIGURE 6b 

CGGTGCAGTGTCCTGACTGTMGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACT^CTTTTGATCTCAGCCCTTG 

MLLWVILL7LAFVS gTq F A R T ? R 

GTGGTCCAGGTCTTCATOCTGCTGTGGGTGATATTACTCGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 
?II? kQP?WTTVFQGERVTL7CKGFR =" 
GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATT^CGCT 
^SPQ.KTKWYHRYLGKE I L R E T P DM I* L 

TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAATACTAAGAGAAACCCCAGACAATATCC-T 
£VQESGEYRCQAOGS?LSSPVHLDFSS 
GAGGTTCAGGAATCTGGAGAG7ACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTT T CT' r C 
ASLILQAPLSVFEGDSVVLRCRAKAEV 
AGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGACTCTGTGGT7CTGAGGTGCCGGGCAAAGGCGGAAG 
TLNNTIYKNDNVLAFLNKRTDFHI PH 

TAACACTGAATAATACTATTTACAAGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDN6AYRCTGYKE5CCPVSSMTVK 
GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATATAAGGAAAGTTGTTGCCCTGTTTCTTCCA^ 

XQVQEPFTRPVLRASSFQPISGNPVTL 
AATCCAAGTCCAAGAGCCATTTACACGTCCAG7GCTGAGAGCCAGCTCCTTCCAGCCCATCAGCGGGAACCCAGTGACCC 

TCETQLSLERSDVPLRFRFFRDOQTL 
TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATG7CCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 
GLGMSLSPNFQI TAMWSKDSGFYWCKA 
GG ATTAGGCTGGAGTCTCTCCCCGAATTTCC AG ATTAC7GCCATGTGGAGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSV1SDSPRSWIQVQIPASHPVL 
AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCC' r G T CC 

?LSPSKALNFEGtXVTLHCETQEDS*L 
TCACTCTCAGCCCTGAAAAGGCTCTGAATTTTGAGGGAACCAAGG7GACACTTCACTGTGAAACCCAGGAAGATTC' r CTG 
STLYRFYHEGVPLRHKSVRCERGASIS 

CGCACTTTG7ACAGG7TT7ATCA7GAGGG7GTCCCCC7GAGGCACAAGTCAGTCCGC7G7GAAAGGGGAGCATCCATCAG 
?SL7TENSGNYYCTADNGLGAK P S K A V 

C7TC7CAC7GAC7ACAGAGAA77CAGGGAAC7AC7AC7GCACAGC7GACAATGGCC7TGGCGCCAAGCCCAGTAAGGCTG 

SLSVTVPVSHFV.NLSSPEDLX FEGA 
7GAGCC7C7CAGTCAC7GTTCCCGTGTC7CATCC7G7CCTCAACC7CAGCTCTCCTGAGGACC7GATTTTTGAGGGAGCC 
KVTLHCEAQRGS-^PILYQFHHEDAALE 
AAGG7GACACTTCAC7G7CAAGCCCAGAGAGGT7CAC7CCCCA7CC7G7ACCAG77TCATCATGAGGA7GC7GCCC7GGA 
RRSANSAGGVAISFSL7AEHSGNYYC7 
GCG7AGG7CGGCCAAC7C7GCAGGAGGAG7GGCCA7CAGCT7CTCTC7GACTGCAGAGCA7TCAGGGAAC7ACTACTGCA 
ADNGFGPQRSKAV SLS I7VPVS HPVL 

CAGC7GACAA7GGC77TGGCCCCCAGCGCAG7AAGGCGG7GAGCC7CTCCATCACTGTCCCTG7G7CTCA7CCTG7 , CCTC 

TLSSAEAL7FEGA7V7LHCE VQRGSPQ 

ACCC7CAGC7C7GC7GAGGCCCTGAC777TGAAGGAGCCACTG7GACAC7TCACTG7GAAGTCCAGAGAGG7TCCCCACA 

ILYQFYHEDMPL WSSS7PSVGRVSFSF 

^JVTCCTATACCAGTT7TATCATGAGGACATGCCCCTG7GGAGCAGC7CAACACCC7C7G7GGGAAGAG7G7CCT7CAGCT 

SLTEGHSGNYYCTADNGFGPQRSEVV 

7C7C7C7GAC7GAAGGACAT7CAGGGAATTAC7AC7GCACAGC7GACAA7GGCTTTGGTCCCCAGCGCAG7GAAG7GG7G 

SL F VTCKCWVLASHPPfcAEFSLTHSFK 

AGCCTTT?TG7CAC7GG7AAG7GC7GGGT7CT^GCCAG7CACCCACCCC7GGCTGAG77CTC7C7CACCCA7TCC7T7AA 
NLFALSSFL?* scop 

AAAJCTGTTTGCACTGTCCAGTTTCCTCCCCTAATCAACTT^ 

:.7T^CCG7AC7CA7AAG7CC7GGC7GAGCCAGACCCr^AAAACAGC7CAG7AGA7TCCCCAGC77T7ACCAAA7GAAT7 

7A7T7A77G7A7777CTCC7CA77CC77G7A7G77CCAACAG7ACGCCAATT7T7CTTGA7GCACGGAGCG7G7CCTACT 

7C7C7AC7GACA777ACA7A77AAC7TAGC7ACAAGCACAG7CTTA7AGA7AAA7A77C-G7CAAGACC77AAA77CTCCA 

AAGGAT77CCAA7CTTA7GG7AGATTTGGAGAAAGC7GC7^G7GAACAAAGGGGGAAA7GGCTCCCTAGGAACCAAC7CC 

7CAAAC77CTGGAGT7TTTATGA7CCCTTG77TTC7AACC7GC7AAAA7CAG7A7CA7TTTAT7GTA77A7T7TAA 

AC7A77G77GAAG7A7GACATACATTCAAGAAACG7G7GCAAATTG7ATGTGTACGA7TTGGTGTCTTTTTAGGAGC7AA 
GTTGCrrCTGTrmACTTGAATCTTTGTTTATAGAAACTGGG^ 

7GA7AGAAAAATCTTGAGCC7GATGTG7CAGACA7GCCCCTAGCA7AAC7TG7TGAGTAAAGAGGTTATTTTTAAAATGT 

GAATC T7CT GAGAC7AC7CCAAAGTCAGAGCCAAA7CTACTAGGAAGC7TCTAGAC7TCA 

7A7CTTTT7ATCCA7GTTTTACTTI\rT7C7CA7AT7CAGCAGCA7C7TAAGC 

CCCTT AATGCC AG7 AG AA7GTAAGCTTC A7G AG AAC AGAAC7GC ATCCATCTTGG 7C7TC AC AAC ATCCCTGTGCCT ACT 
CAGTG7T7GGCACACAGTAGGTCCTCAGTCAACAT7TG7AATTTAGTGGACAGATGATATGACAAGATGATAAGAGGGGA 
77TAAAAAAATCATCT AGCA AAGCCCAAGAGGAAAAAAAACAAAGCTATTTTAGAAATGAAATA 

AGAATAGATTGGA7ATCTTTGAAAACCAT7AATTGAATGAAGAACCAATTTGAGAAAACAATACAGAATC 

AGATACAGAAATAAAGGCAAAAGTTATAATATGGAAA7CAGACAA7GGATTTGTCTGTATCCAGTTATGTGGATAATTAA 

AATGG AG ACCCTC AGAAAATTGAACCG AAG AGT AAAA7G AAACTCAAAAATGTAGTAG AAATTG TTG GG AAGTAAAGAAA 

ACTTGAATATGTAGATCAGAACATATATGTTGATGACGTTATTGACTTTGAGGTTAAAAATATATATATGTGCCTATGAT 

TATGGGGAAAAAAGCAGTCGTCTCAGAAAGAAAAACATCAAGTTAGTCTTAGACTTTGCAGTGCACTCAGTACCAAAG 

AGAGGAGGCCAGACITGGACCTGCGAGGGAAGAATAA7AACCGAAAATTTTATATCAATTCAAAAAGACATTGTCAA 

7ACAGGGATTC AGGAAACTG AG AATGCACTAAGCC77 C7GG AAAAAAC ACCTAATG ACAAAATCTAG CCCAACAAG ATGT 

AAATGAATATAAAGGACTCATAA7GAGGAAACCGCArrATGACTGGCTCTCAACCC7GGCCGCATATTAGACTCGTCAAA 

GACCTITGTAAAAGGTCACACATTGACTCGTCAAAGCCCCTCTCCAGACTAATTCAAT7CAGAATC7CACAGATGGGGCC 

ACAGAATCAGTATTTTTTGACACAACCTCAAGTGAGAATATTGTGTAGACAAGATTGGAAACCACTGATrTAGATAT 

AACAAAGGCTAATCAACTGTGAGAATTATGGTCACAGAATAGAAAGTAACTATTATG 

G7AACAAAGAAAAATAG7TAGAGGAAGGAGAGGAAG7AAAGGAACAATCATTTTCTCATGATTATTATTATTTCAGAGTA 
AATTGTG AGTTA77TC ACAATTC AAAAAG AATG G ACT G7TTTAAAAAATT AGTAAT AG ATTTCAAAATGTCCATTTTGT A 

AATCG777C7GAATACITTGTCAACAG7TACTCA7CATTAATGGC77A7ACTTCACT 

G7AGCC7G7AGAGTCACATAGGAGAGAACAAG7GAAT7CTTTGGG7GGCGCAAGCATAGATGT7AGCAC7GACAAAAAAA 

AATAA7AAAAATAAACC7GTGCA?TGATATGATCACAAATCATCAGGGAAAGAGGAA^ 

TTACAAG7GTAAATTGGTTCAACC7T77CG7mAAT7GACAC^TO 

ATTTTGATA7ACATACATGGTA7A7AACGATCAAATTAGGATAT7TAATG7ACCCATCATCTCATGCATT^ 
7TGG AA7AAA AACATTCAAA AGCCAAAAAA AAAAAAAAAA .AAAAAAAA 
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FIGURE 6c- 1 

i CGG7GCAG7G7CC7GAC7G7AAGA7CAAG7CCAAACC7G7777GGAA77GAGGAAAC77£7C7777GA7C7CAGCCC77G 

M L L W V I L L V L A ? V S G » Q F A R T ? R 22 
81 GTGGTCCAGGTCTTCATCCTGCTGTGGGTGA7ATTACTGG7CCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 

PIIFLQPPWTTVFQGERVTLTCXGFRF49 
161 GCCCA77A7777CC7CCAGCC7CCA7GGACCACAG7C77CCAAGGAGAGAGAC7GACCC7CAC77GCAAGGGA777CGC7 

YSPQKTKWYKRY1GK2ILRE7PDN: L 75 
241 7CTAC7CACCACAGAAAACAAAATGG7ACCA7CGG7ACC77GGGAAAGAAA7ACTAAGAGAAACCCCAGACAA7A7CC77 

SVQESGEYRCQAQGSPLSSPVKLDFSS 102 
321 GAGG7TCAGGAATC7GGAGAG7ACAGA7GCCAGGCCCAGGGC7CCCC7C7CAG7AGCCC7G7GCAC77GGA77T77C77C 

ASLI LQAPLSVFEGDSVVLRCRAXA E V 129 
h 0 1 . AGCTTCGCTGATCC7GCAAGC7CCACTTTC7G7G7T7GAAGGAGAC7C7G7GGTTCTGAGG7GCCGGGCAAAGGCGGAAG 

TLMNTIYKNDNVLAFLNKRTDFHI ?H 155 
461 7AACACTGAATAA7ACTAT77ACAAGAA7G A7AA7GTCCTGGCATTCCTTAA7AAAAGAAC7GACTTCCA7A7TCCTCA7 

ACLKDNGAYRCTGYKESCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGG7GCATATCGC7GTACTGGA7A7AAGGAAAGTTGTTGCCCTG7TTCTTCCAATACAGTCAA 

I Q V Q E P F T R P V L R A S S F Q P I S G N P V T L 209 
641 AATCCAAGTCCAAGAGCCA77TACACGTCCAG7GC7GAGAGCCAGC7CCTTCCAGCCCATCAGCGGGAACCCAG7GACCC 

TCETQLSLERSDVPLRFRFFRDDQT L 235 
721 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGG7CAGATGTCCCGCTCCGGT7CCGC7TCTTCAGAGATGACCAGACCCTG 

GLGWSLSPNFQITAMWSXDSGFYWCKA 262 
3 0 1 GGATTAGGC7GG AGTC7CTCCCCG AATTTCCAGA77ACTGCCA7G7GG AG7AAAGATTCAGGG7TC7ACTGG7G7AAGGC 

ATHPHSVISDS'PRSWZQVQIPASHPVL 289 
S 8 1 AGCAACAATGCCTCACAGCG7CATATC7G ACAGCCCGAGA7CC7GGATACAGG7GCAGA7CCCTGCA7C7CATCC7G7CC 

TLSPEXALNFSGTKVTLHCSTQEDSL 315 
961 7CACTCTCAGCCCTGAAAAGGC7CTGAAT77TGAGGGAACCAAGG7GACAC7TCACTG7GAAACCCAGGAAGA77CTCTG 

RTLYRFYHEGVPLRHKSVRCERGASIS 342 
1041 CGCACTTTGTACAGGTTTT ATCATG AGGGTG7CCCCC7G AGGCACAAG7C AGTCCGC7G7G AAAGGGGAGCA7CCATCAG 

FS LTTENSGNYYC7A2NGLGAKPSKAV369 
1121 CTTCTCACTG ACTACAGAG AATTC AGGGAAC7ACTACTGCACAGCTG ACAATGGCCT7GGCGCCAAGCCC AGTAAGGCTG 

SLSVTVPVSHPVLNLSSPE3LXFSGA 395 
1201 TGAGCCTCTCAGTCACTG77CCCGTG7C7CA7CC7GTCC7CAACC7CAGCTCTCC7GAGGACCTGAT7T7TGAGGGAGCC 

KVTLHCEAQRGSLPILYQFHHEDAALE 422 
1281 AAGGTGACACTTCACTGTG AAGCCCAGAG AGG77CAC7CCCCATCC7G7ACCAGTTTCATCATG AGGATGCTGCCCTGGA 

RRSANSAGGVAISFSL7AEHSGNYYCT 449 
1361 GCGTAGGTCGGCCAACTC7GCAGG AGGAGTGGCCA7CAGC77C7C7C7GAC7GCAGAGC A77CAGGGAAC7AC7ACTGCA 

ADNGFGPQRSKAVSLS ZTVPVSHPVL 47 5 
1441 CAGCTGACAA7GGCT7TGGCCCCCAGCGCAG7AAGGCGGTGAGCC7C7CCATCACTG7CCC7GTGTC7CATCCTGTCCTC 

TLSSAEALTFEGATVTLHCSVQRGS ?Q 502 
1521 ACCCTCAGCTCTGCTGAGGCCCTGACTTTTGAAGGAGCCAC7G7GACACT7CACTG7GAAG7CCAGAGAGG7TCCCCACA 

ILYQFYKEOMPLWSSSTPSVGRVSFSF 529 
1601 AATCCTATACCAGTTTTA7CA7GAGGACATGCCCC7G7GGAGCAGC7CAACACCC7C7G7GGGAAGAGTG7CC77CAGC7 

SLTEGKSGNYYCTADMGFG P Q R S E V V 555 
1681 TCTCTCTGACTGAAGGACA77CAGGGAA77ACTAC7GCACAGC7GACAATGGC777GG7CCCCAGCGCAGTG AAGTGGTG 

SLFVTV PVSR ?I LTLRV PRAQAVVGDL 5 82 
1 7 6 i AGCCTTTTTGTCACTG7TCCAG7GTC7CGCCCCA7CC7CACCC7CAGGG7TCCCAGGGCCCAGGCTG7GG7GGGGGACC7 

LELHCEAPRGSPPILYWFYHSOVT.GS 609 
1 S 4 1 GCTGGAGC7TCAC7G7GAGGCCCCGAGAGGC7C7CCCCCAA7CC7G7AC7GG7777A7CA7G AGG A7G7CACCC7GGGGA 

SSA?SGGEASFnL3L7AEKSG!*YSCE 63 5 
1921 GCAGCTCAGCCCCCTC7GGAGGAG AAGCTTC777CAACCTCTCTC7G AC7GCAGAACAT7CTGG AAACTACTCA7GTGAG 

ANNGLVAQHSD7ISLSVIV?VSRP:-7 662 
2001 GCCAACAATGGCC7AG7GGCCCAGCACAG7G ACACAA7A7CAC7CAG7G77A7AG77CCAGTA7C7CG7CCC A7CC7C AC 

FRAPRAQAVVGOLL E1HCSAL.RGS S PI 689 
2081 CTTCAGGGCTCCCAGGGCCCAGGC7G7GG7GGGGGACC7GC7GG AGC77CAC7G7GAGGCCC7G AGAGGCTCC7CCCCAA 

LYWFYHEDV7LGKI S A ? S a> G G G AS F N L 715 
2161 7CCTG7AC7GG7777A7CA7GAAGA7G7CACCC7GGG7AAGA7C7CAGCCCCC7C7GGAGGAGGGGCC7CC77CAACC7C 

SLTTEHSGIYSCEAOXGLEAQRSEMVT "42 
2241 7CTCTGACTACAG AACA77C7GGAA7C7AC7CC7G7GAGGCAGACAA7GG7C7GGAGGCCCAGCGCAG7GAGA7GG7GAC 

LXVAVPVSRPVLTLRAPGTHAAVGDLL 769 
2321 ACTGAAAG77GCAG77CCGG7G7C7CGCCCGG7CC7CACCC7CAGGGC7CCCGGGACCCA7GCTGCGGTGGGGGACC7GC 

ELKCEALRGSPLI LYRFFHSDVTLGN 795 
2401 7GGAGC7TCAC7G7G AGGCCC7G AGAGGC7C7CCCC7GA7CC7G7ACCGG777777CA7G AGGA7G7CACCC7AGGAAAT 

RSSPSGGASLNLSLTAEKSGNYSCZAD e22 
2481 AGGTCG7CCCCC7C7GGAGGAGCG7CC77AAACC7C7C7C7G AC7GCAG AGCAC7C7GGAAACTACTCC7G7GAGGCCGA 

NGLGAQRSETV7LY:7GLTANRSG?FA849 
2 561 CAATGGCCTCGGGGCCCAGCGCAG7G AG ACAG7G AC ACTTTATATCACAGGGCTG ACCGCGAAC AG AAG7GGCCCT7T7G 

T G V A g Ct L T, S ? A G L A A G A L T. v C W L S R 875 

2641 CCACAGGAG7CGCCGGGGGCC7GC7C AGCA7AGCAGGCC77GC7GCGGGGGCAC7GC7GC7C7AC7GC7GGC7C7CGAGA 

KAGRKPASDPAR SPSDSDSQSPT Y H M V 902 
2721 AAAGCAGGGAGAAAGCC7GCC7C7GACCCCGCCAGGAGCCC77CAGAC7CGGAC7CCCAAGAGCCCACC7A7CACAA7GT 

PAWEELQPVY7NANPRGENVV Y S Z V R I 929 
:S01 ACCAGCCTGGG AAGAGC7GCAACC AG7G7 AC AC7AA7GCAAA7CC7 AGAGG AG AAAA7G7GG777 ACTC AGAAG7 ACGG A 

IQEKKKKAVASDPRKLRNKGSPi: Y S 955 

2881 7CATCCAAGAGAAAAAGAAAC A7GCAG7GGCC7C7GACCCCAGGCA7C7CAGGAACAAGGG77CCCC7A7CA7C7ACTC7 
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FIGURE 6c- 2 



VAS TPVS GS LFLASSAPHR 



stop 



GAAGTTAAGGTGGCGTCAACCCCGGTTTCCGGATCCCTGTTCTTGGCTTCCTCAGCTCCTCACAGATGAGTCCACACGTC 
TCTCCAACTGCTGTTTCAGCCTCTGCACCCCAAAGTTCCCCTTGGGGGAGAAGCAGCATTGAAGTGGGAAGATTTAGGC7 
GCCCCAGACCATATCTACTGGCCTTTGTTTCACATGTCCTCATTCTCAGTCTGACCAGAATGCAGGGCCCTGCTGGACTG 
TCACCTGTTTCCCAGTTAAAGCCCTGACTGGCAGGTTTTTTAATCCAGTGGCAAGGTGC 

ATCTCCTGGATTCCTTAGTGGGCTTCAGCTGTGGTTGCTGTTCTGAG7ACTGCTCTCATCACACCCCCACAGAGGGGGTC 
TTACCACACAAAGGGAGAGTGGGCCTTCAGGAGATGCCGGGCTGGCCTAACAGCTCAGGTGCTCCTAAACTCCGACACAG 
AGTTCCTGCTTTGGGTGGATGCATTTCTCAATTGTCATCAGCCTGGTGGGGCTAC 

CACACAGCCTGTGCACATGGGACATGTGATGGGTCTCCCC^CGGGGGCTGCATTTCACACTCCTCCACCTGTCTCAAACT 
CTAAGGTCGGCACTTGACACCAAGGTAACTTCTCTCCTGCTCATGTGTCAGTGTCTACCTGCCCAAGTAAGTGGCTTTCA 
TACACCAAGTCCCGAAGTTCTTCCCATCCTAACAGAAGTAACCCAGCAAGTCAAGGCCAGGAGGACCAGGGGTGCAGACA 
GAACACATACTGGAACACAGGAGGTGCTCAATTACTATTTGACTGACTGACTGAATGAATGAATGAATGA^ 
TGTGGGTAATC AAACTGGCAT AAAATCCAGTGC ACT CC CT AGG AAATCCGGG 

G AAG AGAAGG AGCTTGGATGAAGAAACTGTTCAGCAAG AAG AAGGGCTTCTTCACAC TTTT ATGTGCTTGTGGATCAC CT 
GAGGATCTGTGAAAATACAGATACTGATTCAGTGGGTCTGTGTAGAGCCTGAGACTGCCATTCTAACATGTTCCCAGG^ 
ATGCTGATGCTGCTGGCCCTGGGACTGCACTGCATGCATGTGAAGCCCTATAGGTCTCAGCAGAGGCCCATGGAGAGGGA 
ATGTGTGGCTCTGGCTGCCCAGGGCCCAACTCGGTTCACACGGATCGTGCTGCTCCCTGGCCAGCCTTO 

CACCAGCTGCTGTTGCTGAGAGAGCTTCTTCTCTGTGACATGTTGGCTTTCATCAGCCACCCTGGGAAGCGGAAAGTAGC 
TGCCACTATCTTTGTTTCCCCACCTCAGGCCTCAGACTTTCCCATGAA 

ATT CAG AGTTG TTCTC CC ATCTCTGAGC AATGG G ATGTTCTG TT CCG CTTTT ATG AT AT C CATC ACATCTTATCTTG ATC 
TTTGCTCCCAGTGGATTGTACAGTGATGACTTTTAAGCCCCACGGCCCT 

TC^CTCCACCTGAACCATGGCTTTTCATGCTTCCAAGTGTCAGGGCCTTGCCCAGATAGACAGGGCTGACTCT 
CAACCTTTCAAGGAGGAAACCAGACACCTGAGACAGGAGCCTGTATGCAGCCCAGTGCAGCCTTGC^ 
GAGGCATTTGTCATCACTACAGATATGCAACTAAAATAGACGTGGAGCAAGAGAAATGCATTCCC^ 
TTAGGCCTAGTTGAAAGTCAAGAAGGACAGCAGCAAGCATAGGCTCAC^ATT^ 

CTGGAGGTCACATCACCAACAAAGCTCACGCCCTATGCAGTTCTGAGAAGGTGGAGGCACCAGGCTCAAA^ 
TAGAATTTCTCATTGGGAGAGTAAGGTACCCCCATCCCAGAATGATAACTGCACAGTGGCAGAACAAACT 
GTGGGTGGACCCCATCCAGTCTGTTGAAGGCCTGAATGTAACAAAAGGGCTTATTCTTC 
GCITrGGGCTGGGACATAAGTTTTTCTGCTTTCAGACGCAAACTGAA 

ATATGGACTGAAAGAAACTATGCTATTGGATCTCCTGGATCTCCAGCTTGCTGACTGCAGATCTTGAGATATGTCAGCCT 
CTACAGTCACAAGAGCTAATTCATTCTAATAAACCAATCTTTC 



WO 01/38490 



PCT/USOO/32403 



11/34 



CM 




6 

H 

fa 



•*» CD 



B 

CD 
H 
fa 




(A 

£ 
a 

T a 



ITTT 



CD 



CD 



CO 




I — 



OS 



CM 



I X 

0) 



CQ —J 
^ CD 

o cc 



H 
fa 



00 r-i 
in a 




in en n 



o r> o r» »« 

a «A V r4 Cs is 
V »fl VO SO r« 



CP 
H 



WO 01/38490 



PCT/USOO/32403 



12/34 



FIGURE 8A 
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FIGURE 18A 



1 CTCAATCAGCTTTATGCAGAGAAGAAGCTTACTGAGCTCACTGCTGGTGCTGGTGTAGGCAAGTGCTGCTTTGGCAA 

M L L W A S 6 
78 TCTGGGCTGACCTGGCTTGTCTCCTCAGAACTCCTTCTCCAACCCTGGAGCAGGCTTCCATGCTGCTGTGGGCGTCC 

LLAFAPVCG^SAAAHKPVISVHPPWT 32 
155 TTGCTGGCCTTTGCTCCAGTCTGTGGACAATCTGCAGCTGCACACAAACCTGTGATTTCCGTCCATCCTCCATGGAC 

TFFKGERVTLTCNGFQFYATEKTT WY 58 
232 CACATTCTTCAAAGGAGAGAGAGTGACTCTGACTTGCAATGGATTTCAGTTCTATGCAACAGAGAAAACAACATGGT 

HRHYWGEKLTLTPGNTLEVRESGLY 83 
309 ATCATCGGCACT ACTGGGGAGAAAAGTTGACCCTG ACCCCAGG AAACACCCTCGAGGTTCGGGAATCTGG ACTGTAC 

RCQARG S PRSN PVRLLF S S D S L I LQ A 109 
386 AG ATGCCAGGCCCGGGGCTCCCCACGAAGTAACCCTGTGCGCTTGCTCTTTTCTTC AG ACTCCTTAATCCTGC AGGC 

PYSVFEGDTL VLRCHRRRKEKLTAVK 135 
463 ACCATATTCTGTGTTTG^^^gACATTGGTTCTGAGATGCCACAGAAGAAGGAAAGAGAAATTGACTGCTGTGA 

Y T W N G R^^^^S S ISNKSWDLLIPQASSN 160 
540 AATATACTTGG AATGGAAAC ATTCTTTCCATTTCTAATAAAAGCTGGG ATCTTCTTATCCCACAAGCAAGTTC AAAT 

NNGNYRCIGYGDENDVFRSNFKIIKI 186 
617 AACAATGGCAATTATCGATGCATTGGATATGGAGATGAGAATGATGTATTTAGATCAAATTTCAAAA^AA^AA^IT 

QELFPH PELKATDSQPTEGNSV |i§|^||il C 21 2 
694 TCAAGAACTATTTCCACATCCAGAGCTGAAAGCTACAGACTCTCAGCCT 

ETQL PPERSDTPLHFNFFRDGEVI L 237 
771 GTGAAACACAGCTTCCTCCAGAGCGGTCAGAC ACCCCACTTCACTTCAACTTCTTCAGAGATGGCGAGGTCATCCTG 

SDWSTYPELQLPTVWRENSGS YWCG A 263 
848 TCAGACTGGAGCACGTACCCGGAACTCCAGCTCCCAACCGTCTGGAGAGAAAACTCAGGATCCTATTGGTGTGGTGC 

ETVRGNIHKHSPSLQIH VQRI PVSGV 289 
92 5 TG AAACAGTGAGGGGTAACATCCACAAGC AC AGTCCCTCGCTACAGATCCATGTGCAGCGG ATCCCTGTGTCTGGGG 

L LETOPSGGQAVEGEMLVLVC SVAE 314 
1002 TGCTCCTGGAGACCCAGCCCTCAGGGGGCCAGGCTGTTGAAGGGGAGATGCTGGTCCTTGTCTGCTCCGTGGCTGAA 

GTGDTTFSWHREDMQESLGRKTQRSL 340 
1079 GGCAC AGGGGATACCACATTCTCCTGGCACCGAGAGGACATGCAGG AG AGTCTGGGGAGG AAAACTCAGCGTTCCCT 

RAELELPAI RQSHAGGYYCTADNSYG 366 
1156 GAGAGCAGAGCTGGAGCTCCCT GCCATCAGAC AGAGCCATGCAGGGGGATACTACTGTACAGCAGACAACAGCTACG 

P V Q S M V L gpP j P gP$ V RETPGNRDGLVAAG 391 
1233 GCCCTGTCCAGAGCATGGTGCTGAATGTCACTGTGAGAGAGACCCCA^GCAACAGAGATGGCCTTGTCGCCGCGGGA 

A T G G L T , S A I, I, I, A V A L T, F H C W R R R K S G 417 
1310 GCC ACTGG AGGGCTGCTCAGTGCTCTTCTCCTGGCTGTGGCCCTGCTGTTTCACTGCTGGCGTCGGAGGAAGTCAGG 

VGFLGDETRL .PPAPGPGESSHSICPA 443 
1387 AGTTGGTTTCTTGGG AGACGAAACCAGGCTCCCTCCCGCTCCAGGCCCAGG AGAGTCCTCCCATTCCATCTGCCCTG 

Q V E L Q S L Y V D . V HPKKGDLV Y S E I Q T 468 
1464 CCCAGGTGGAGCTTC AGTCGTTGT ATGTTGATGTACACCCCAAAAAGGG AGATTTGGTATACTCTG AGATCCAGACT 

TQLGEEEEANTSRTLLEDKDVSVV X § 4 $4 

1541 ACTCAGCTGGGAGAAGAAGAGGAAGCTAATACCTCCAGGACACTTCTAGAGGATAAGGATGTCTCAGTTGTCTACTC 

g V KTQHPDNSAGKI SSKDEES * 515 

1618 TG AGGTAAAGACACAACACCCAGATAACTCAGCTGGAAAGATCAGCTCTAAGGATGAAG AAAGTTAAGAGAATGAAA 
1695 AGTTACGGGAACGTCCTACTCATGTGATTTCTCCCTTGTCCAAAGTCCCAGGCCCAGTGCAGTCCTTGCGGCACCTG 
1772 GAATGATCAACTC ATTCC AGCTTTCT AATTCTTCTCATGCATATGC ATTCACTCCCAGG AATACTCATTCGTCTACT 
1849 CTGATGTTGGGATGGAATGGCCTCTGAAAGACTTCACTAAAATGACCAGGATCCACAGTTAAGAGAAGACCCTGTAG 
1926 TATTTGCTGTGGG CCTG ACCT AATGC ATTCCCT AGGGTCTG CTTTAG AG AAGGGGG AT AAAG AG AG AG AAGG ACTGT 
2003 TATGAAAAACAGAAGC AC AAATTTTGGTGAATTGGGATTTGCAGAGATGAAAAAGACTGGGTGACCTGGATCTCTGC 
2080 TTAATACATCTACAACCATTGTCTCACTGGAGACTCACTTGC ATCAGTTTGTTTAACTGTGAGTGGCTGCACAGGCA 
2157 CTGTGCAAACAATGAAAAGCCCCTTCACTTCTGCCTGCACAGCTTACACTGTCAGGATTC AGTTGCAGATTAAAG AA 
2234 CCCATCTGGAATGGTTTACAGAGAGAGGAATTTAAAAGAGG ACATCAG AAGAGCTGGAG ATGCAAGCTCTAGGCTGC 
2311 GCTTCCAAAAGCAAATGATAATTATGTTAATGTCATTAGTGACAAAG ATTTGCAACATTAGAGAAAAG AG ACACAAA 
2388 TATAAAATTAAAAACTTAAGTACCAACTCTCCAAAACTAAATTTGAACTTAAAAT ATTAGTATAAACTC ATAATAAA 
• CTCTGCCTTTAAATAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18B-1 



IRTA2A 
IRTA2C 
IRTA2B 1 

81 

161 

241 

321 

401 



CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTG 
~~~~ MLLWVILLv LAPVSGToFARTPR r>o 

GTGGTCCAGGTCTTCATGCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGAC^ 

FLQPPWTTVF 0GERVTLTCKr7rn^ / , o 
GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAG^ 49 

YSPQKTKWYHRYLGKEILRETPDMTT ^ 
TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAATACTAAGAGAAACCCCAGA 

EVQESGEYRCQAQGSPLSSPVHLDFSQ ^ nr> 

GAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 
ASLILQAPL SVFEGDSVVLRCRAKAPU ioq 

AG ST^S GCMGCTCCAOTTC ™^^ 



^gp^gragJ IYKNDNV LAFLNKRTDFHIP m 155 
4 5 1 TAACACTH3A*TO&^ 

ACLKDNGAYRCTGYKESCCPVSS NTVK ifl? 
GCATGTCTCAAGG ACAATGGTGCAT ATCGCTGTACTGGATATAAGG AAAGTTGTTGCCCTGTTTCTTCCAATAC AGTCAA 

1 FTR PVLRASSFQPISGNPVTL209 



561 
641 



AATCC AAGTCC AAG AGC C ATTT AC ACGTCC AGTGCTG AG AG CC AGCTCCTTCC AG CCC ATC AG CGGG AACCCAG TG AC CC 



TCETQLSLERSDVPLRFRFFRDDOTI 
7 2 1 TGACCTGTGAG ACCCAGCTCTCTCTAG AGAGGTCAG ATGTCCCGCTCCGGTTCCGCTTCTTCAG AGATGACCAG ACCCTG 

GLGWSLSPNFQIT AMWSKDSGFYWCKA 262 
801 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGGAGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSVISDSPRSWIQVQIPASHPVL299 
881 AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCCTGTCC 

TLSPEKALNFEGTKVTLHCETQEDSL 315 
TC ACTCTC AG CCCTGAAAAGGCTCTGAATTTTG AGGG AACC AAGG TG ACACTTCACTG TG AAACCCAGGAAG ATTCTCTG 



961 



1121 
1201 
1281 
1361 
1441 
1521 
1601 
1681 

2A,2C1761 
2B 1761 



RTLYRFYHEGVPLRHKSVRCERGASI S 342 
CGCACTTTGTACAGGTTTTATCATGAGGGTGTCCCCCTGAGGCACAAGTCAGTCCGCTGTGAAAGGGGAGCATCCATCAG " " 
LA_iLA_ T ^NSGNYYCTADNGLGAK PSKAV 369 



395 
422 



CTTCTCACTGACTACAGAGAATTCAGGGAACTACTACTGC ACAGCTGACA ATGGCCTTGGCGCCAAGCCCAGTAA^rTr: 

SLSVTVPVSHPVL gflPllll S PEDLI FEGA 
TGAGCCTCTCAG^^ 

KVTLHCEAQRGSLPILYQFHHEDAALE 
AAGGTGACACTTCACTGTGAAGCCCAGAGAGGTTCACTCCCCATCCTGTACCAGTTTCATCATGAGGATGCTGCCCTGGA 
RRSANSAGGVA I SFSLTAEHSGNYYCT449 

GCGTAGGTCGGCCAACTCTGCAGGAGGAGTGGCCATCAGCTTCTCTCTGACTGCAGAGCATTCAGGGAACTACTACTGCA * 

ADNGFGPQRSKAVSLSITVPVSHPVL 475 
CAGCTGACAATGGCTTTGGCCCCCAGCGCAGTAAGGCGGTGAGCCTCTCCATCACTGTCCCTGTGTCTCATCCTGTCCTC 
TLSSAEALTFEGATVTLHCEVQRGSPO 502 
ACCCTCAGCTCTGCTGAGGCCCTGACTTTTGAAGGAGCCACTGTGACACTTCACTGTGAAGTCCAGAGAGGTTCCCCACA 

ILYQFYHEDMPLWSSSTPSVGRVSFSF529 
AATCCTATACCAGTTTTATCATGAGGACATGCCCCTGTGGAGCAGCTCAACACCCTCTGTGGGAAGAGTGTCCTTCAGCT 

SLTEGHSGNYYCTADNGFGPQRSEVV 555 
TCTCTCTGACTG AAGG ACATTC AGGG AATTACTACTG CAC AGCTG AC AATGGCTTTGGTCCCC AG CG C AGTG AAG TGG TG 

SLFVTVPVSRPILTLRVPRAQAVVGDL 582 
AGCCTTTTTGTCACTGTTCCAGTGTCTCGCCCCATCCTCACCCTCAGGGTTCCCAGGGCCCAGGCTGTGGTGGGGGACCT 

GKCWVLASHPPLAEFSLTHSFK 582 
" ■GGTAAGTGCTGGGTTCTTGCCAGTCACCCACCCCTGGCTGAGTTCTCTCTCACCCATTCCTTTAA 

L E LHCEAPRGSPPILYWFYHEDVTLGS 609 
GCTGGAGCTTCACTGTGAGGCCCCGAGAGGCTCTCCCCCAATCCTGTACTGGTTTTATCATGAGGA 
NLFALSSFLP* stop 
2B 1841 AAATCTGTrTGCACTGTCCAGTTTCCTC 



2A.2C1841 



592 



SSAPSGGEASF B|f|f|i^ L T A E H S G 




C E 635 



2 A , 2C1 92 1 GCAGCTCAGCCCCCTCTGGAGGAGAAGCTTCTTT<?AA^^ 
2B 1921 GTTTTCCGTACTCATAAGTCCTGGCTCAGCCAGACCCCTAAAACAGCTCAGTAGATTCCCCAGCTTTTACCAAATGA^ 

ANNGLVAQHSDTISLSVIVPVSRPILT 662 
It ' 2C l °Ji 1 OCCAACAATGGCCTAGTGGCCCAGCACAGTGACACAATATCACTCAGTGTTATAGTTCCAGTATCTCGTCCCATCCTCAC 
2B 2001 TATTTATTGTATTTTCTCCTCATTCCTTGTATGTTCCAACAGTACGCCAATTTTTCTTGATGCA 



2A,2C208l 
2B 2081 



JL„ APRA QAVVGDLLELHCEALRGSSPI 689 
CTTCAGGGCTCCCAGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGAGAGGCTCCTCCCCAA 
TCTCTACTG AC ATTT ACATATTAACTT AG CT ACAAGC AC AGTCTTAT AG ATAAAT ATTGG TCAAG ACCTT AAATTCTC CA 
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FIGURE 18B-2 

LYWFYHEDVTLGKISAPSGGGASF 71 



2A, 2C2161 TCCTGTACTGGTTTTATCATGAAGATGTCACCCTGGGTAAGATCTCAGCCCCCTCTGGAGGAGGGGCCTCCTTCAJ<-_ _ 
2 B 2161 AAGGATTTCCAATCTTATGGTAGATTTGGAGAAAGCTGCTGGTG AACAAAGGGGGAAATGGCTCCCTAGG AACCAACTCC 



LTTEHSGIYSCEADNGLEAQRS EMVT 742 
2A,2C2241 TCTCTGACTACAGAACATTCTGGAATCTACTCCTGTGAGGCAGACAATGGTCTGGAGGCCCAGCGCAGTGAGATGGTGAC 
2 B 2241 TCAAACTTCTGGAGTTTTTATGATCCCTTGTTTTCTAACCTGCTAAAATCAGTATCATTTTATTGTATTATTTTAAAAAA 

LKVAVPVS RPVLTLRAPGTHAAVG DLL 769 
2C 2321 ACTG AAAGTTG CAGTTC CG GTGTC TCGCC CGG TCCTC ACC CTC AGGGCTCCCGGG ACCC ATG CTGCGGTG GG GG AC CTGC 

GEWALPTSSTSEN* 759 

2 A 2321 GGTGAGTGGGCCCTGCCCACCAGCAGCACATCTGAGAACTGACTGTGCCTGTTCTCCCTGCAGCTGA 

2 B 2321 ACTATTGTTGAAGTATG AC ATACATTCAAGAAACGTGTGCAAATTGT ATGTGTACGATTTGGTGTCTTTTTAGGAGCTAA 

ELHCEALRGSPLILYRFFHEDVTLG ESS 795 
2C 2401 TGGAGCITCACTGTGAGGCCCTGAGAGGCTCTCCCCTGATCCTGTACCGGTTTTTTCATGAGGATGTCACCCTAGGAMT 
2 A 2401 AAATGGAGCCACAGAGCTCCTCAGGGCTGTTTGCTTGTGTGGCATCCCAGCACACTTCCTGCCTGCAGAACCTCCCTGTG 
2B 2401 GTTGCITCTGTTTTTACTTGAATCITTGTTTATAGAAACTGGGGGAAAGTTTACTTTCTTT^ 




SPSGGASL I^^^^L T A E H S G ^^^^^C E A D 822 
2C 2481 ^GfCGTCCCCCTCTGGAGGAGCGTCCTTA^^ 

2 A 2481 AAAGTCTCGGATCCTTTGTGGTATGGTTCCAGGAATCTGATGTTTCCCAGCAGTCTTCTTGAAGATGATCAAAGCACCTC 
2 B 2481 TGATAGAAAAATCTTG AGCCTG ATGTGTCAGACATGCCCCTAGCATAACTTGTTG AGTAAAG AGGTTATTTTTAAAATGT 



NGLGAQ RSETVTLY I TGLTA ^^^^^f G P F A 849 

2C 2561 CAATGGCCTCGGGGCCCAGCGCAGTGAGACAGTGACACTTTATATCACAGGGCTGACCGC^^^^TGGCCCTTTTG 

2 A 2561 ACTAAAAATGCAAATAAGACTTTTTTAGAACATAAACTATATTCTGAACTGAAATTATTACATGAAAATGAAACCAAAGA 

2 B 2561 GAATGTTCTGAG ACTACTCCAAAGTCAGAGCCAAATCTACTAGGAAGCTTCTAG ACTTCACTCATTCTGCATCCCATTAC 

T C V A R R T, T, fi T A fl I, A A G A L I, L v r W T. S R 875 

2C 2641 CCACAGGAGTCGCCGGGGGCCTGCTCAGCATAGCAGGCCTTGCTGCGGGGGCACTGCTGCTCTACTGCTGGCTCTCGAGA 

2 A 2641 ATTCTGAGCATATGTTTCTCTGCCGTAGAAAGGATTAAGCTGTTTCTTGTCCGGATTCTTCTCTCATTG ACTTCTAAGAA 

2 B 2641 TATCTTTTTATCC ATGTTTTACTTTCTTCTCATATTCAGCAGCATCTTAAGCCTCTTTATTTTCTGTTTCTTGACTGTCA 

KAGRKPASDPARSPSDSDSQEPT Y H N V 902 

2C 2721 AAAGCAGGGAGAAAGCCTGCCTCTGACCCCGCCAGGAGCCCTTCAGACTCGGACTCCCAAGAGCCCACCTATCACAATGT 

2 A 2721 GCCTCTACTCTTGAGTCTCTTTCATTACTGGGGATGTAAATGTTCCTTACATTTCCAC ATTAAA AATCCTATGTTAACGA 

2 B 2721 CCCTTAATGCCAGTAGAATGTAAGCTTCATGAGAAC AGAACTGCATCCATCTTGGTCTTCACAAC ATCCCTGTGCCT ACT 

PAWEEL QPVYTNANPRGENVV Y S E V R I 929 

2C 2801 ACCAGCCTGGGAAGAGCTGCAACCAGTGTACACTAATGCAAATCCTAGAGGAGAAAJ^TGTGGTTTACTCAGAAGTACGGA 

2A 2801 AAAAA 

2 B 2801 CAGTGTTTGGCACACAGTAGGTCCTCAGTCAACATTTGTAATTTAGTGG ACAGATGATATGACAAGATG ATAAG AGGGGA 

IQEKKKHAVASDPRHLRNKGSPII X £ 955 

2C 2881 TCATCCAAGAGAJUU^GAAACATGCAGTGGCCTCTGACCCCAGGCATCTCAGGAACAAGGGTTCCCCTATCATCTACTCT 

2B 2881 TTTAAAAAJ\ATCATCTAGCAAAGCCCAAGAGGAAAAAAA 

£_V KVASTPVSGSLFLASSAPHR* stop 97 

2C 2961 GAAGTTAAGGTGGCGTCAACCCCGGTTTCCGGATCCCTGTTCTTGGCTTCCTCAGCTCCTCACAGATGAGTCCACACGTC 

2 B 2961 AGAATAGATTGGATATCTTTGAAAACCATTAATTGAATG 

2C 3041 TCTCCAACTGCTGTTTCAGCCTCTGCACCCCAAAGTTCCCCTTC 

2 B 3041 AGATACAGAAATAAAGGCAAAAGTTATAATATGX3AAATCAGACAATC 

2C 3121 GCCCCAGACC^TATCTACTGGCCTTTGTTTCACATGTCCTCATTCTCAGTCTGACCAGAATGCAGGGCCCTGCTGGACTG 

2B 3121 AATGGAGACCCTCAGAAAATTGAACCGAAGAGTAJVAATGAAACTCAAAAATGTAGTAGAA^ 

2C 3201 TCACCTGTTTCCCAGTTAAAGCCCTGACTGGCAGGTTTTTTAATCCAGTGGCAAGGTGCTCCCACTCCAGGGCCCAGCAC 

2B 3201 ACTTGAATATGTAGATCAGAACATATATGTTGATGACGTTATTGACTTTGAGGTTAAAAATATATATATGTGCCTATGAT 

2C 3281 ATCTCCTGGATTCCTTAGTGGGCTTCAGCTGTGGTTGCTGTTCTGAGTACTGCTCTCATCACACCCCCACAGAGGGGGTC 

2 B 3281 TATGGGGAAAAAAG CAGTCGTCTC AGAAAGAAAAACATC AAGTTAGTCTTAGACTTTGCAGTGCACTCAGTACCAAAG AG 

3361 TTACCAC^CAAAGGGAGAGTGGGCCTTCAGGAGATGCCGGGCTGGCCTAACAGCTCAGGTGCTCCTAAACTCCGACACAG 

3441 AGTTCCTGCTTTGGGTGGATGCATTTCTCAAT^^ 

3 521 CAC ACAGCCTGTGCACATGGG ACATGTG ATGGGTCTCCCC ACGGGGGCTGCATTTCACACTCCTCCACCTGTCTCAAACT 

3 601 CTAAGGTCGGCACTTGACACCAAGGTAACTTCTCTCCTGCTCATGTGTCAGTGTCTACCTGCCCAAGTAAGTGGCTTTCA 

3681 TACACCAAGTCCCGAAGTTCTTCCCATCCTAACAGAAGTAACCCAGCAAGTCAAGGCCAGGAGGACCAGGGGTGCAGACA 

3761 GAACACATACTGG AACAC AGG AGGTGCTC AATTACTATTTGACTGACTG ACTGAATG AATG AATGAATGAGG AAGAAAAC 

3 841 TGTGGGTAATC AAACTGGC ATAAAATCCAGTGCACTCCCTAGGAAATCCGGG AGGTATTCTGGCTTCCTAAGAAACAACG 

3921 GAAG AG AAGG AGCTTG GATG AAG AAACTGTTCAGCAAG AAG AAGGGCTTCTTCAC ACTTTTATG TG CTTGTGG ATCACCT 

4001 GAGGATCTGTGAAAATACAGATACTGATTCAGTGGGTCTGTGTAGAGCCTGAGACTGCCATTCTAACATGTTCCCAGGGG 
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FIGURE 18B-3 



4081 

«* ^^^^^^^^^^^^^^ 

A*\ ATTC^AGTTGTTCTCCCATCTCTGAGCAATGGGATGTTCTGTTCCGCTTTTATGATATCCATCACATC^ATCT 

4641 
4721 

till SI^I"^™^^ 
50n 



nil aSc£^ 

:CACAGCAC 
iAAAGTAGC 
CCCTCTCC 
TCTTGATC 

caacctttcaaggag^ccagacacctgagacaggagcctgtatgcagcccagtccagccto^ 

™ ggca t tctcatcactaca ^^^ 

TTAGnrrTAGTTGAAAGTCAAGAAGGACAGCAGCAAGCATAGGCTCAGGATTAAAGAAAAA^TCTCCTCACJ 
JACATCACCAACAAAGCTCACGCCCTATGCAGTTCTGAGAAGGTGGAGGCACCAGGCTCAAAAGi 



5121 GC^G T GG G CTG^ 

PTGCTGGC 

CTACAGTCACAAGAGCTAATTCATTCTAATAAACCAATCTTTC ~— ..—.^JICMCCT 



5201 

5281 ™r ^1^°^"^^ 



t 
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FIGURE 18C-1 

1 AGTGAAGGGGTTTCCCATATGAAAAATACAGAAAGAATTATTTGAATACTA 
5 2 G C AAAT AC AC AACTTG AT ATTTCT AG AG AACCCAGGC AC AGTCTTGG AG AC 
103 ATTACTCCTGAGAGACTGCAGCTGATGGAAGATGAGCCCCAACTTCTAAAA 
154 ATGTATCACTACCGGGATTGAGATACAAACAGCATTTAGG AAGGTCTCATC 
205 TGAGTAGCAGCTTCCTGCCCTCCTTCTTGGAGATAAGTCGGGCTTTTGGTG 
256 AGACAGACTTTCCCAACCCTCTGCCCGGCCGGTGCCCATGCTTCTGTGGCT 

1 M L L W L 

307 GCTGCTGCTGATCCTGACTCCTGGAAGAGAACAATCAGGGGTGGCCCCAAA 

6LLLILTPGREQSGVAPK 
358 AGCTGTACTTCTCCTCAATCCTCCATGGTCCACAGCCTTCAAAGGAGAAAA 

23AVLLLNPPWSTAFKGEK 
409 AGTGGCTCTCATATGCAGCAGCATATCACATTCCCTAGCCCAGGGAGACAC 

40VALICSSI SHSLAQGDT 
460 ATATTGGTATC ACGATGAGAAGTTGTTGAAAATAAAACATGACAAGATCC A 

57YWYHDEKLLKIKHDKIQ 
511 AATTAC AGAGCCTGG AAATTACC AATGTAAGACCCGAGGATCCTCCCTCAG 

74IT EPGNYQCKTRGSSLS 
562 TGATGCCGTGC ATGTGGAATTTTC ACCTG ACTGGCTGATCCTGC AGGCTTT 

91DAVHVEFSPDWLILQAL 
613 ACATCCTGTCTTTGAAGGAGACAATGTCATTCTGAGATGTCAGGGGAAAG A 
108 HPVFEGDNVILRCQGKD 
664 CAACAAAAACACTCATCAAAAGGTTTACTACAAGGATGGAAAACAGCTTCC 
125 NKNTHQKVY YKDGKQLP 
715 TAATAGTTATAATTTAGAGAAGATCACAGTGAATTCAGTCTCCAGGGATAA 
142 NS YNLEKI TVNSVSRDN 
766 TAGCAAATATCATTGTACTGCTTATAGGAAGTTTTACATACTTGACATTGA 
159 SKYHCTAYRKFYI LDIE 
817 AGTAACTTCAAAACCCCTAAATATCCAAGTTCAAGAGCTGTTTCTACATCC 
176 VTSKPLNIQVQELFLHP 
868 TGTGCTGAGAGCCAGCTCTTCCACGCCCATAG AGGGGAGTCCCATGACCCT 
193VLRAS. SSTPIEGSPMTL 
919 G ACCTGTGAGACCCAGCTCTCTCC ACAGAGGCCAGATGTCCAGCTGCAATT 
210 TCETQLSPQRPDVQLQF 
970 CTCCCTCTTCAG AGAT AGCCAGACCCTCGGATTGGGCTGGAGCAGGTCCCC 
227 SLFRDSQTLGLGWSRSP 
1021 CAGACTCCAGATCCCTGCCATGTGGACTGAAGACTCAGGGTCTTACTGGTG 

244 RLQI PAMWTEDSG SYWC 
107 2 TGAGGTGGAGACAGTGACTCACAGCATCAAAAAAAGGAGCCTGAGATCTCA 

261 EVETVTHS IKKRSLRSQ 
1123 GATACGTGTACAGAGAGTCCCTGTGTCTAATGTGAATCTAGAGATCCGGCC 

278 IRVQRVPVSNVNLEIRP 
1174 CACCGG AGGGC AGCTGATTGAAGG AG AAAATATGGTCCTTATTTGCTC AGT 

295 TGGQLIEGENMVLICSV 
122 5 AGCCCAGGGTTCAGGGACTGTCACATTCTCCTGGCACAAAGAAGGAAGAGT 

312 AQGSGTVTFSWHKEGRV 
127 6 AAGAAGCCTGGGTAGAAAGACCCAGCGTTCCCTGTTGGCAGAGCTGCATGT 

329 RSLGRKTQRSLLAELHV 
1327 TCTC ACCGTGAAGG AG AGTGATGCAGGGAGATACTACTGTGCAGCTG ATAA 

346 LTVKESDAGRYYCAADN 
137 8 CGTTCACAGCCCCATCCTCAGCACGTGGATTCGAGTCACCGTGAGAATTCC 

363 VHSPILSTWIRVTVRIP 
1429 GGTATCTCACCCTGTCCTCACCTTCAGGGCTCCCAGGGCCCACACTGTGGT 

380 VSHPVLTFRAPRAHTVV 
1480 GGGGGACCTGCTGGAGCTTCACTGTGAGTCCCTGAGAGGCTCTCCCCCGAT 

397 GDLLELHCESLRGSPPI 
1531 CCTGTACCGATTTTATCATGAGGATGTCACCCTGGGGAACAGCTCAGCCCC 

414 LYRFYHEDVTLGNSSAP 
1582 CTCTGGAGG AGG AGCCTCCTTCAACCTCTCTCTGACTGCAGAACATTCTGG 

431 SGGGASFNLSLTAEHSG 
1633 AAACTACTCCTGTGATGCAGACAATGGCCTGGGGGCCCAGCACAGTCATGG 

448 NYSCDADNGLGAQHSHG 
1684 AGTG AGTCTCAGGGTCACAGTTCCGGTGTCTCGCCCCGTCCTCACCCTCAG 

465 VSLRVTVPVSRPVLTLR 
1735 GGCTCCCGGGGCCC AGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTG A 

483 APGAQAVVGDLLELHCE 
1786 GTCCCTGAGAGGCTCCTTCCCGATCCTGTACTGGTTTTATCACGAGGATGA 

499 SLRGSFPILYWFYHEDD 
1837 C ACCTTGGGG AACATCTCGGCCCACTCTGG AGGAGGGGCATCCTTCAACCT 

516 TLGNISAHSGGGASFNL 
1888 CTCTCTGACTAC AGAACATTCTGGAAACTACTCATGTGAGGCTGACAATGG 
533 SLTTEHSGNYSCEADNG 
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FIGURE 18C-2 



1939 CCTGGGGGCCCAGCACAGTAAAGTGGTGACACTCAATGTTACAGGAACTTC 

550 LGAQHSKVVTLNVTGTS 
1990 C AGG AAC AG AACAGGCCTTACCGCTGCGGGAATCACGGGGCTGGTGCTCAG 

567 RNRTGLTAAGITGLVLS 
2 041 CATCCTCGTCCTTGCTGCTGCTGCTGCTCTGCTGCATTACGCCAGGGCCCG 

584 LVLAAAAALLHYARAR 
2092 AAGGAAACCAGGAGGACTTTCTGCCACTGGAACATCTAGTCACAGTCCTAG 

601RKPGGLSATGTSSHSPS 
2143 TGAGTGTCAGGAGCCTTCCTCGTCCAGGCCTTCCAGGATAGACCCTCAAGA 

618ECQEPSSSRPSRIDPOE 
2194 GCCCACTCACTCTAAACCACTAGCCCCAATGGAGCTGGAGCCAATGTACAG 

635 PTHSKPLAPMELEPMYS 
2245 CAATGTAAATCC TGG AG ATAG C AACCCG ATTTATTCCCAG ATCTG GAGCAT 
Jll * PGDSNPIYSQIWSI 



«v«r^usNPIYSQlWSl 
2296 CCAGCATACAAAAG AAAACTCAGCTAATTGTCCAATG ATGCATCAAGAGCA 

669 QHTKENSANCPMMHQEH 
2347 TGAGGAACTTACAGTCCTCTATTCAGAACTGAAGAAGACACACC 

686 EELTVLYSELKKTHP 
2398 CTCTGCAGGGGAGGCTAGCAGCAG AGGCAGGGCCCATGAAGAAG 

703 SAGEASSRGRAHEED 

244 9 AGAAAACTATGAGAATGTACCACGTGTATTACTGGCCTCAGACC 
720 ENYENVPR VLLASDH 

2500 CCTTACCCAGAGTGGCCCACAGGAAACAGCCTGCACCATTTTTTTTTCTGT 
2551 TCTCTCCAACCACACATCATCCATCTCTrCAf; ArTrwrr 



:tagcc 
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FIGURE 18D-1 



1 TGGTGACCAAGAGTACATCTCTTTTCAAATAGCTGGATTAGGTCCTCATGC 
1 ML 
5 2 TGCTGTGGTCATTGCTGGTCATCTTTG ATGCAGTCACTGAACAGGCAGATT 
19 LWSLLVIFDAVTEQADS 

103 CGCTGACCCTTGTGGC G CC CTCTTCTGTCTTCG AAGG AG AC AGC ATC G TTC 
36 LTLVAPSSVFEGDSIVL 

154 TGAAATGCCAGGGAGAACAGAACTGGAAAATTCAGAAGATGGCTTACCATA 
53 KCQGEQNWKIQKMAYHK 

205 AGGATAACAAAGAGTTATCTGTTTTCAAAAAATTCTCAGATTTCCTTATCC 
70 DNKELSVFKKFS DFLIQ 

256 AAAGTGCAGTTTTAAGTGACAGTGGTAACTATTTCTGTAGTACCAAAGGAC 
87 SAVLSDSGNYFCSTKGQ 

307 AACTCTTTCTCTGGGATAAAACTTCAAATATAGTAAAGATAAAAGTCCAAG 

104 LFLWDKTSNIVKIKVQE 
358 AGCTCTTTCAACGTCCTGTGCTGACTGCCAGCTCCTTCCAGCCCATCGAAG 
121 LFQRPVLTASSFQPIEG 
409 GGGGTCCAGTGAGCCTGAAATGTGAGACCCGGCTCTCTCCACAG AGGTTGG 
138 G PVSLKCETRLS PQRLD 
460 ATGTTC AACTCCAGTTCTGCTTCTTCAGAG AAAACCAGGTCCTGGGGTCAG 

155 VQLQFCFFRENQVLGSG 
511 GCTGGAGCAGCTCTCCGGAGCTCCAGATTTCTGCCGTGTGGAGTGAAGACA 
172 WSSSPELQISAVWSEDT 
562 C AGGGTCTTACTGGTGC AAGGCAGAAACGGTG ACTCACAGGATCAGAAAAC 
189 GSYWCKAETVTHRIRKQ 
613 AGAGCCTCCAATCCCAGATTCACGTGCAGAGAATCCCCATCTCTAATGTAA 

206 SLQSQIHVQRIPISNVS 
664 GCTTGGAGATCCGGGCCCCCGGGGG AC AGGTGACTGAAGGAC AAAAACTG A 
223 LEIRAPGGQVTEGQKLI 
715 TCCTGCTCTGCTCAGTGGCTGGGGGTACAGGAAATGTCACATTCTCCTGGT 
240 LLCSVAGGTGNVTFSWY 
766 ACAGAGAGGCCACAGGAACCAGTATGGGAAAGAAAACCCAGCGTTCCCTGT 

257 R EA TGT SMGK KT QRS L S 
817 CAGCAGAGCTGGAGATCCCAGCTGTGAAAGAGAGTGATGCCGGCAAATATT 
274 AELEIPAVKE S D A G K Y Y 
868 ACTGTAGAGCTGACAACGGCCATGTGCCTATCCAGAGCAAGGTGGTGAATA 
291 CRADNGHVPI QSKVVNI 
919 TCCCTGTGAGAATTCCAGTGTCTCGCCCTGTCCTC ACCCTCAGGTCTCCTG 
308 PVRIPVSRPVLTLRSPG 
97 0 GGGCCCAGGCTGCAGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGA 
325 AQAAVGDLLE LHCEALR 

1021 GAGGCTCTCCCCCAATCTTGTACCAATTTTATCATGAGGATGTCACCCTTG 
342 GSPPILYQFYHEDVTLG 
107 2 GGAACAGCTCGGCCCCCTCTGGAGGAGGGGCCTCCTTCAACCTCTCTTTGA 
359 NSSAPSGGGASFNLSLT 
1123 CTGCAG AAC ATTCTGG AAACT ACTCCTGTG AGGCCAACAACGGCCTGGGGG 
376 AEHSGNYSCEANNGLGA 
1174 CCCAGTGCAGTGAGGCAGTGCCAGTCTCCATCTCAGGACCTGATGGCTATA 
393 QCSEAVPVSISGPDGYR 
1225 G AAGAG ACCTC ATGAC AGCTGG AGTTCTCTGGGG ACTGTTTGGTGTCCTTG 
410 RDLMTAGVLWG LFGVLG 
127 6 GTTTCACTCGTGTTGCTTTGCTGTTGTATGCCTTGTTCCACAAGATATCAG 
427 FTGVALLLYALFHKISG 
1327 G AG AAAGTTCTGCCACTAATG AACCC AGAGGGGCTTCCAGGCC AAATCCTC 
444 ESS 'ATNEPRGASRPNPQ 
1378 AAGAGTTCACCTATTCAAGCCCAACCCCAGACATGGAGGAGCTGCAGCCAG 
461 EFTYSSPTPDMEELQPV 
1429 TGT ATGTC AATGTGGGCTCTGTAGATGTGG ATGTGGTTTATTCTC AGGTCT 
478 YVNVGSVDVDVVYSQVW 
1480 GGAGCATGCAGCAGCC AGAAAGCTCAGCAAACATC AGGAC ACTTCTGG AGA 
495 SMQQPESSANIRTLLEN 
1531 ACAAGGACTCCCAAGTCATCTACTCTTCTGTGAAGAAATCATAACACTTGG 

512 KDSQVIYSSVKKS 
1582 AGGAATCAGAAGGG AAG ATCAAC AGCAAGG ATGGGGCATC ATTAAGACTTG 
1633 CTATAAAACCTTATGAAAATGCTTGAGGCTTATCACCTGCCACAGCCAGAA 
1684 CGTGCCTCAGG AGGCACCTCCTGTCATTTTTGTCCTGATG ATGTTTCTTCT 
1735 C C AAT ATCTTCTTTTAC CT ATC AAT ATTC ATTG AACTGCTGCT AC ATC C AG 
1786 ACACTGTGCAAATAAATTATTTCTGCTACCTTCTCTT AAGCAATCAGTGTG 
1837 TAAAGATTTG AGGGAAG AATGAATAAGAGATAC AAGGTCTC ACCTTC ATCT 
1888 ACTGTG AAGTGATGAG AAC AGGACTTGATAGTGGTGT ATTAACTT ATTTAT 
193 9 GTG CTGCTGGATAC AGT TTGCT AATATTTTGTTGAGAATTTTTGC AAAT AT 
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1990 GTTCATTGGGAATATTGGCCTGAAATTTTCTTTTCC ACTGTGTCTCTGCCA 
2041 G AATGTTTGTATCAGGCTG ATGCTGGCTTC ATAGAATG AGTTAGGCAGG AG 
2092 CCCTTCCTCCTTGATTTTTTGGCATAGTTTCAGCAGGATTGGTACCAGTTA 
2143 TTCTTTCTGCATCTTGTAGAATTCAGCTATGAATCCATCTGGTCTAGGGCT 
2194 TTTGTGTTGGTTGGTAAGTTTTTTATT ACTAATTC AACTTC AGCGCTTG AT 
2245 ATTGGTCTAGGAGGGGTTTCTGTCTCTTCCTGGTTCAATCTTGGGAGATTG 
2296 TGTGTTTCCAGGAATTTAGCCGTTTCCTCCAGATTTTCTTCTTTATGTGCA 
2347 TCGACTTGAGTGTAAACATAACTTATATGCACTGGGAAACCAAAAAATCTG 
2398 TGTG ACTTGCTTTATTGCAGCATTTGTTTTATTTTGGTAGTCTGGAACTGA 
2449 ACCTGCAATATCACCAAAGTATGCATATAGTTGCAAAAATGTGATTTTTGA 
2 500 CATAGTAAATATGAGTATTTGCAATAAACTATGATATTACTTTTGTAAGTA 
2551 TATAGAATAAAATGTAAAT AATCT AT AAAA 
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FIGURE 18E-1 



1 GAGGCATCTCTAGGTACCATCCCTGACCTGGTCCTC 
3 7 ATGCTGCCG AGGCTGTTGCTGTTG ATCTGTGCTCCACTCTGTG AA 

MLPRLLLLICAPLCE 
8 2 CCTGCCGAGCTGTTTTTGATAGCCAGCCCCTCCCATCCCACAGAG 
PAELFLIASPS .HPTE 
127 GGGAGCCCAGTGACCCTGACGTGT AAG ATGCCCTTTCTACAGAGT 

GSPVTLTCKMPFLQS 
172 TCAGATGCCCAGTTCCAGTTCTGCTTTTTCAGAGACACCCGGGCC 

SDAQFQFCFFRDTRA 
2 17 TTGGGCCCAGGCTGGAGCAGCTCCCCCAAGCTCCAGATCGCTGCC 

L GPGWSSS P KLQIAA 
262 ATGTGGAAAGAAGACACAGGGTCATACTGGTGCGAGGCACAGACA 

MWKEDTGSYWCEAQT 
307 ATGGCGTCC AAAGTCTTG AGGAGCAGG AGATCCCAG ATAAATGTG 

MASKVLRSRRSQINV 
352 CACAGGGTCCCTGTCGCTGATGTGAGCTTGGAGACTCAGCCCCCA 

HRVPVADVSLETQPP 
397 GG AGGAC AGGTGATGGAGGG AG ACAGGCTGGTCCTCATCTGCTCA 

GGQVMEGDRLVLICS 
442 GTTGCTATGGGC ACAGG AGACATCACCTTCCTTTGGTACAAAGGG 

VAMGTGDITFLWYKG 
487 GCTGTAGGTTTAAACCTTCAGTCAAAGACCCAGCGTTCACTGACA 

AVGLNLQSKTQRSLT 
532 GCAG AGTATGAGATTCCTTC AGTG AGGGAG AGTGATGCTGAGCAA 

AEYEI PSVRESDAEQ 
57 7 TATTACTGTGTAGCTGAAAATGGCTATGGTCCCAGCCCCAGTGGG 

YYCVAEN GYGPSPSG 
622 CTGGTGAGC ATCACTGTCAGAATCCCGGTGTCTCGCCC AATCCTC 

LVSITVRI PVSRPIL 
667 ATGCTCAGGGCTCCCAGGGCCCAGGCTGCAGTGGAGGATGTGCTG 

MLRAPRAQAAVEDVL 
712 GAGCTTCACTGTGAGGCCCTGAGAGGCTCTCCTCCAATCCTGTAC 

ELHCEALRGSPPILY 
7 57 TGGTTTTATCACGAGGATATCACCCTGGGGAGCAGGTCGGCCCCC 

WFYHEDITLGSRSAP 
802 TCTGGAGGAGGAGCCTCCTTCAACCTTTCCCTG ACTGAAGAACAT 

SGGGASFNLSLTEEH 
847 TCTGG AAACT ACTCCTGTG AGGCC AAC AATGG C CTGGGGGCCC AG 

SGNYSCEANNGLGAQ 
892 CGCAGTGAGGCGGTGACACTCAACTTCACAGTGCCTACTGGGGCC 

RSEAVTLNFTVPTGA 
937 AG AAGCAATCATCTTACCTCAGG AGTCATTGAGGGGCTGCTCAGC 

RS NHLTSGVIEGLLS 
982 ACCCTTGGTCCAGCCACCGTGGCCTTATTATTTTGCTACGGCCTC 
TLGPATVALLFCYGL 
1 0 2 1 AAAAGAAAAATAGGAAGACGTTCAGCCAGGGATCCACTCAGGAGC 

KRKIGRRSARDPLRS 
1 072 CTTCCCAGCCCTCTACCCCAAGAGTTCACCTACCTCAACTCACCT 

LPSPLPQEFTYLNSP 
1117 ACCCCAGGGCAGCTACAGCCTATAT ATGAAAATGTGAATGTTGTA 

TPGQLQPIYENVNVV 
1162 AGTGGGGATGAGGTTTATTCACTGGCGTACTATAACCAGCCGGAG 

SGDEVYSLAYYNQPE 
1207 CAGGAATC AGTAGC AGCAGAAACCCTGGGGACAC ATATGG AGGAC 

QESVAAETLGTHMED 
1252 AAGGTTTCCTTAGACATCTATTCCAGGCTGAGGAAAGCAAACATT 

KVSLDIYSRLRKANI 
1297 ACAGATGTGGACTATGAAGATGCTATGTAA 1326 
TDVDYEDAM * 

GGTT ATGGAAGATT CTGCTCTTTG 
1351 AAAACCATCC ATGACCCCAA GCCTCAGGCC TGATATGTTC TTCAGAGATC 
1401 CTGGGGCATT AGCTTTCCAG TATACCTCTT CTGGATGCCA TTCTCCATGG 
1451 CACTATTCCT TCATCTACTG TGAAGTGAAG TTGGCGCAGC CCTGAAGAAA 
1501 CTACCTAGGA GAACTAATAG ACACAGGAGT GACAGGGACT TTGTTATCAG 
1551 AACCAGATTC CTGCCGGCTC CTTTGAAAAC AGGTCATATT GTGCTCTTCT 
1601 GTTTACAAGA GGAAACAAGA TGGAATAAAA GAAATTGGGA TCTTGGGTTG 
1651 GAGGGACAGT GAAGCTTAGA GCACATGAAC TCAAGGTTAG TGACTCTGCA 
1701 GGACTTCACA GAGAGAGCTG TGCCCATCAT TCAGTCCAAG TGCTTTCTCT 
1751 GCCCAGACAG CACAGAACTC CAGCCCCGCT ACTTACATGG ATCATCGAGT 
1801 TTCCACCTAA AATATGATTC TATTTATTTT GAGTCACTGT TACCAAATTA 



WO 01/38490 



PCT/USOO/32403 



34/34 



FIGURE 18E-2 



1851 GAACTAAAAC 

1901 GTGACGTATT 

1951 GTATCTATTA 

2001 ATTCACAAAA 

2051 AATT ATTTTT 

2101 GGAATGCAGT 

2151 CAAGCAAACC 

2201 TGCCACCAAA 

2251 GTTTTGCCCA 

2301 TTT 



AAAGTTACAT 
TTTGTATATA 
CAGCCCCTAG 
TTTTTGAAAT 
TTTTTAAATT 
GGCACAATCT 
TCTCACCTCA 
CTTGGCCATT 
GGCTGGTCTC 



AAAAAGTTAT 
TAGGCCAACC 
AAGCTTTATA 
CGTGGTAATA 
GAGACAGGGT 
TGCCTCACTG 
GCCTGCTGAG 
TTTTGTCTTA 
AAACTCCTGG 



TGTGACTCCA 
TATACCACAT 
AATACAGTGT 
TGGTTTGAAA 
CTCACTCTGT 
CAACGCCTGC 
TAGCTGGGAC 
CGTAGAGACA 
GCTCAAGCAA 



CTTAATTTTA 
CCAAAATTAT 
GTCTTCTTTT 
CCTGTATCTT 
CACTCAATCT 
CTCTCAGGCT 
TACAGGCACA 
AGATTTCACC 
TGTATTGAAT 



